Question regarding learning rate graph from W2 logistic regression lab

Deepti_Prasad · July 28, 2023, 7:43am

Hello,

as you remember the function of gradient descent is to find cost function as minimum as possible with the help of learning rate. But using too large learning rate does not find you convergence in the result of cost function to be minimum and finding too small learning leads you too cause large number iterations. So to choose the right learning rate, one needs to go from 0.001 then 0.01 and 0.1 if you are training a model. Prof. Andrew has also suggested one more way is to increase the learning rate to three time. Eg if you have choose learning rate of 0.001 for a model training, then next choose 0.001 x 3 that 0.003 which will show you the model result with higher learning rate and then keep on reducing the learning rate toward 0.001 to find the proper fit for the cost function.

The initial spike in the model training you will always find because usually parameters are initiated at zero to get the required cost function. The goal is to achieve the least cost function with smaller learning rate and also in achievable iteration to reduce the training time. So as the iteration goes higher, the cost function is also reducing and after some iteration one sees a constant reduction in cost function.

I am attaching few of the images by Prof. Andrew explaining about learning rate, gradient descent. It is self explanatory. If you still are not satisfied with the answer, do ask!!!

Regards
DP

Topic		Replies	Views
Learning Rate - C1_W2_Lab03 Supervised ML: Regression and Classification week-module-2	6	539	April 26, 2023
Gradient descent Neural Networks and Deep Learning coursera-platform	4	652	December 15, 2021
Learning rate. - course notes Neural Networks and Deep Learning week-module-2 , coursera-platform	10	319	February 2, 2024
Dynamic adjustment of the learning rate Supervised ML: Regression and Classification week-module-2	2	613	August 27, 2022
Learning rate on Regularization Supervised ML: Regression and Classification week-module-3	5	340	December 21, 2023

Question regarding learning rate graph from W2 logistic regression lab

Related topics