Can the gradient descent converge even if the learning rate set large and fixed

Ufuk_Hurriyetoglu · August 6, 2022, 9:23pm

hi,

In the Learning Rate section of Gradient Descent chapter. First we note that if we set alpha the learning rate large then the algorithm may not converge. I agree with that.

By the end of the lecture we also note that if the learning rate is fixed than the gradient descent will also converge. So, we may not need to decrease the learning rate.

However in case of large alpha which may cause overshoot of minimum and due to that will diverge, we don’t have any life saver than decreasing the learning rate.

The last mention about fixed learning rate sounds confusing and I wanted to note that.

Thanks for the great course and effort!

shanup · August 6, 2022, 9:38pm

@Ufuk_Hurriyetoglu

Welcome to the community.

Setting a large \alpha can make w bounce around and can even cause it to diverge instead of converging. However, if we can find a high enough value of \alpha that will not make \frac {\partial J} {\partial w} flip from +ve to -ve for each iteration, then it should be okay.

Coming to your 2nd point about keeping the learning rate fixed: Once we identify a high enough value of \alpha that can steadily bring w to convergence, we don’t need to worry about reducing \alpha. The value of \frac {\partial J} {\partial w} as it approaches the minima will drop down to very very low values, and hence the \alpha * \frac {\partial J} {\partial w} value will continuously reduce as it approaches the vicinity of the minima.

Topic		Replies	Views
Why it will overshoot and never reach the minimum? (The point getting away from lowest point?) Supervised ML: Regression and Classification week-module-1	4	40	November 11, 2024
MLS C1 W1 About the Learning rate course Supervised ML: Regression and Classification week-module-1	4	578	July 15, 2022
Parameters Diverging When Learning Rate is too Large Supervised ML: Regression and Classification week-module-2	1	408	June 6, 2023
Learning Rate - C1_W2_Lab03 Supervised ML: Regression and Classification week-module-2	6	540	April 26, 2023
Understanding Mini batch size Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	571	July 10, 2021

Can the gradient descent converge even if the learning rate set large and fixed

Related topics