Why Non-linearities are needed:
1. Wihtout non-linearities, deep neural networks can't do anything more than a linear transform.
2. Extra layers could just be compiled down into a single linear transorm:
3. With more layers, they can approximate more complex functions.