Why learning rate is tmp_alpha = 1.0e-2, other numbers won’t work. How do you test the ideal number between iterations and alpha.

Try a lot of combinations.

But first, be sure you normalize the data set. This makes the mathematics of gradient descent much more predictable, so it’s easier to find a usable learning rate.

1 Like