Probabilistic View: from activation function determine Error Function
By using gradient information minimum can be found in O(W2) steps
• 4,000 steps for 10 x 10 x 10 network with 200 weights
Probabilistic View: from activation function determine Error Function
By using gradient information minimum can be found in O(W2) steps
• 4,000 steps for 10 x 10 x 10 network with 200 weights