In the video “Using an Appropriate Scale to pick Hyperparameters”, it says that if we are tuning the learning rate in the range (0.001, 1) in the iinear scale, 90% of the samples would be in (0.1, 1). I could’t get the mathematical justification for that.