《Gradient-Based Learning Applied to Document Recognition》
Background knowledge
1. Gradient-based learning
2. Back propagation: gradients can be computed efficiently by propagation from the outputto the input对误差进行反向传播,更新权值
Xn is a vector representing the output of the module. Wnis thevect