Week 2 Lab 3 Question About Feature Scaling

Wesley_Liu · January 8, 2024, 5:03pm

I have a conceptual question about gradients in relation to feature scaling, related to the image below:

As I learned in my multivariable calc class at college, gradients will always be perpendicular to level set (contours). I was wondering how it would be possible for the first part of the image to then oscillate between contours, since the gradient should always be pointing perpendicular? How would that be attributed to a lack of feature scaling (assuming that the learning rate is not too large)?

TMosh · January 8, 2024, 7:48pm

That figure is just a sketch.

Andrew is drawing the worst-case situation where the learning rate is too high, and the changes in the weights causes the cost to overshoots the best trajectory.

He could not draw all those arrows perpendicular to the curves, because they would all be right on top of each other, and the sketch would be unreadable.

Topic		Replies	Views
About gradient descent and Features scaling Supervised ML: Regression and Classification week-module-2	6	615	August 19, 2022
Feature Scaling Part 1: optimizing number of elements in dw Supervised ML: Regression and Classification week-module-2	1	26	November 24, 2024
Graph in optional lab : feature scaling and learning rate Supervised ML: Regression and Classification week-module-2	2	541	March 3, 2023
Feature Scaling: Why not use separate learning rates instead of rescaling features? Supervised ML: Regression and Classification week-module-2	1	400	August 5, 2023
Is my understanding of Feature Scaling correct? Supervised ML: Regression and Classification week-module-2	3	547	August 12, 2022

Week 2 Lab 3 Question About Feature Scaling

Related topics