Sussillo, David, and L. F. Abbott. “Random walk initialization for training very deep feedforward networks.” arXiv preprint arXiv:1412.6558 (2014). [Citations: 3].
1 Motivation
[Motivation] Gradient vanishing problem.
2 Linear Random Walk Initialization
[Network Form]
[Backprop]
[Simplifications]
• All layers have same width n .
• Initialize each W^(l) from N(0,