Why following hyparameter tuning function is incorrect?

sahil5710 · January 28, 2025, 11:51am

In Week 3 Quiz, there is following question

If you think β (hyper parameter for momentum) is between 0.9 and 0.99, which of the following is the recommended way to sample a value for beta?
I am confused in two options

beta= 1-10**(-r-1)
beta=0.09*r +0.9

The correct answer is 1st option but I am unable to understand what is wrong with 2nd option

Please help me to clarify this

rmwkwok · January 28, 2025, 1:22pm

Hello, @sahil5710,

Approach 1 uses log scale, and approach 2 linear scale. Andrew introduced both of them and recommended approach 1 in the C2 W3 lecture video titled “Using an Appropriate Scale to pick Hyperparameters”. Log scale tries values that are more different (in terms of the effect to the model training process) from one another than linear scale does. However, I would recommend you to go through that video again first.

Cheers,
Raymond

sahil5710 · January 29, 2025, 10:20am

Thank you! Will surely look into this

TMosh · January 29, 2025, 6:08pm

Example:
Say you wanted to explore the effect of a model parameter that perhaps varies between 1 and 100.

You could try values between 1 and 100, incrementing by 10 each time. That’s a linear progression, and would require 11 test runs.

Or you could try the values 1, 3, 10, 30, and 100 (a common approximation of log scaling), and explore the same range with 5 tests.

If necessary, you can then perform another narrower search.

sahil5710 · January 30, 2025, 3:42pm

Got it thanks

Topic		Replies	Views
Sampling hyperparameter for momentum Improving Deep Neural Networks: Hyperparameter tun quiz-help , week-module-3 , coursera-platform	8	294	February 2, 2025
Appropriate scale to pick hyperparameter week 3 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	551	August 8, 2021
C2 W3 Quiz question Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	2	31	November 3, 2024
How to sample hyperparameters? (DOUBT) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	585	May 4, 2021
Log Scale for Hyperparamers Explanaion Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	497	June 22, 2022

Why following hyparameter tuning function is incorrect?

Related topics