A doubt on gradient descent

JJaassoonn · June 13, 2023, 6:36am

Dear Administrator,

Could you please guide me on this issue?

Training examples with normalized features can speed up gradient descent compared to unnormalized features.
Increasing the number of training examples can improve the accuracy of prediction of a learning model.

Question:
By increasing the number of training examples with normalized features, will the gradient descent be speeding up or slowing down when finding for the minimum point of cost function?

Thank you.

TMosh · June 13, 2023, 6:46am

Gradient descent is faster with normalized features because you can use a higher learning rate.

Having more examples always requires more computation, so that means more examples is slower.

moshood_olawale · June 13, 2023, 6:58am

Sure. However, blatantly making the choice of a higher learning rate results in a poor model as it fails to converge

JJaassoonn · June 13, 2023, 9:43am

Dear Mr Tom Mosher and Mr Moshood Olawale

Thank you so much for your information.

TMosh · June 13, 2023, 1:22pm

That happens if the learning rate is too high.

Topic		Replies	Views
C1_W2_Lab03_Feature_Scaling_and_Learning_Rate_Soln cost Supervised ML: Regression and Classification week-2	3	225	February 21, 2024
Optional Lab: Feature Engineering and Polynomial Regression (Feature scaling impact on Convergence) Supervised ML: Regression and Classification week-2	2	532	September 2, 2022
Optional Lab: Feature scaling and Learning Rate (Multi-variable) Supervised ML: Regression and Classification week-2	1	477	August 24, 2022
The relation between scaling and learning rate Supervised ML: Regression and Classification week-2	3	536	March 27, 2023
Can someone help explain mathematically why normalizing inputs could improve convergence speed in gradient descent? Improving Deep Neural Networks: Hyperparameter tun week-1 , coursera-platform	1	34	January 10, 2025

A doubt on gradient descent

Related topics