Week 3 Uniform Sampling

bgoyal · May 30, 2021, 6:33am

I don’t understand why we do uniform sampling on a logarithmic scale instead of the standard scale. Is there any mathematical basis for this? Thanks in advance!

nramon · May 30, 2021, 2:38pm

Hi, @bgoyal.

As explained in this lecture, computing the exponentially weighted average is approximately equivalent to taking the average of the last 1/(1 - Beta) days.

As you can see, 1/(1 - Beta) is very sensitive to small changes in Beta when Beta is close to 1:

beta

If you sample uniformly, more often than not you’ll end up exploring a small subset of the range of 1/(1 - Beta):

uniform

Instead, you want to sample more densely (the formulas were removed, but I hope the point is clear) in the regions where Beta is closer to 1:

log_uniform

Let me know if that helped

amitp · July 12, 2021, 7:03am

Thank you for you explanation. This is helpful. I wonder what is the incentive of having to take samples from denser region over narrower region? If we choose beta closer to 1 does that mean we can do weighted average on many previous values.

Topic		Replies	Views
Appropriate scale to pick hyperparameter week 3 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	571	August 8, 2021
How to sample hyperparameters? (DOUBT) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	586	May 4, 2021
Exponentially Weighted Average Understanding Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	634	June 1, 2021
Exponentially weighted Average Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	742	May 17, 2021
What does this sentence mean? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	556	July 10, 2021

Week 3 Uniform Sampling

Related topics