Question regarding learning rate graph from W2 logistic regression lab

Anibar_B · July 28, 2023, 6:53am

Greetings, if it’s not a problem, can someone please explain why in this graph, the cost for learning rate 0.01 seems to fluctuate at first but then goes steadily down ? I understand initially it might have made too much of a change leading to the first spike, but I don’t understand why after around 400 iterations, it seems to go on a steady downward slope.

gent.spah · July 28, 2023, 6:57am

It seems that after the 400 it. the cost pointer has stuck in the “valley” and this learning rate is not suffkcient to overshoot the valley so it goes on decreasing rapidly.

Deepti_Prasad · July 28, 2023, 7:43am

Hello,

as you remember the function of gradient descent is to find cost function as minimum as possible with the help of learning rate. But using too large learning rate does not find you convergence in the result of cost function to be minimum and finding too small learning leads you too cause large number iterations. So to choose the right learning rate, one needs to go from 0.001 then 0.01 and 0.1 if you are training a model. Prof. Andrew has also suggested one more way is to increase the learning rate to three time. Eg if you have choose learning rate of 0.001 for a model training, then next choose 0.001 x 3 that 0.003 which will show you the model result with higher learning rate and then keep on reducing the learning rate toward 0.001 to find the proper fit for the cost function.

The initial spike in the model training you will always find because usually parameters are initiated at zero to get the required cost function. The goal is to achieve the least cost function with smaller learning rate and also in achievable iteration to reduce the training time. So as the iteration goes higher, the cost function is also reducing and after some iteration one sees a constant reduction in cost function.

I am attaching few of the images by Prof. Andrew explaining about learning rate, gradient descent. It is self explanatory. If you still are not satisfied with the answer, do ask!!!

Regards
DP

Anibar_B · July 28, 2023, 12:29pm

Thank you !

Topic		Replies	Views
Learning Rate - C1_W2_Lab03 Supervised ML: Regression and Classification week-2	6	536	April 26, 2023
Gradient descent Neural Networks and Deep Learning	4	650	December 15, 2021
Learning rate. - course notes Neural Networks and Deep Learning week-2	10	319	February 2, 2024
Dynamic adjustment of the learning rate Supervised ML: Regression and Classification week-2	2	608	August 27, 2022
Learning rate on Regularization Supervised ML: Regression and Classification week-3	5	338	December 21, 2023

Question regarding learning rate graph from W2 logistic regression lab

Related topics