1. How to make sure gradient descent is working correctly
2. How to choose learning α rate
-- if α is too small : slow convergence.
-- if α is too large : J(θ) may not decrease on every iteration;may not converge.
1. How to make sure gradient descent is working correctly
2. How to choose learning α rate
-- if α is too small : slow convergence.
-- if α is too large : J(θ) may not decrease on every iteration;may not converge.