Lecture comment on choosing optimal learning rate

Roland_Schuetz · November 7, 2022, 3:31pm

In the last video before the quiz, “Deep Neural Network”, Laurence makes the comment about choosing the optimal learning rate: "In this case it looks to be about two notches to the left of 10 to the minus 5.
So I’ll say it’s 8 times 10 to the minus 6, or thereabouts. "

How does one get from 10**-5 to 8*10**-6? Where does this offset come from?

balaji.ambresh · November 7, 2022, 5:02pm

As you might’ve observed in the figure, 10^{-5} is the point beyond which the loss gets unstable.

8*10^{-8} lies well inside the low loss region that’s less than 10^{-5} and loss is relatively stable. There is no rule on what value to pick. The goal is to select a learning rate based on the graph. You could try with rates like 9*10^{-6} as well.

Roland_Schuetz · November 7, 2022, 5:40pm

Thank-you for clarifying.

Topic		Replies	Views
Selecting optimal learning rate from Learning rate scheduler Sequences, Time Series and Prediction week-module-2	1	15	February 12, 2025
C4 W2 \| Deep Neural Network \| Selection of lr value Sequences, Time Series and Prediction week-module-2	7	628	October 24, 2021
DNN video \| Choosing new learning rate (1:58) Sequences, Time Series and Prediction week-module-2	1	538	October 20, 2021
Learning Rate Tuning techniques Sequences, Time Series and Prediction week-module-2	2	556	July 3, 2022
C4W3 Analyzing Learning Rate Using LearningRateScheduler Sequences, Time Series and Prediction week-module-3	1	11	September 25, 2024

Lecture comment on choosing optimal learning rate

Related topics