Gradient Descent starting at the "top of the hill"

Alain · June 17, 2022, 10:56pm

Always exciting to revisit these fundamental concepts as we look at things differently.

I was curious about the starting point for Gradient Descent when you are on top of the hill as explained in the video.
How do you begin taking that “baby step” with the fastest drop when you are starting on top of the hill and all immediate directions are equal?

tharunnayak14 · June 18, 2022, 12:02am

I think that if all the immediate directions are equal then we are at a local maxima, where the derivative is 0 and we are stuck at that point, in reality when we initialise weights randomly there is a very less chance we encounter a local maxima, most of the times we are not actually on the top of the hill but somewhere in between.

Even if we get stuck at local minima, we may run the model many times hoping it would reach global optima in any one of those.

correct me if anything’s wrong

Alain · June 18, 2022, 12:33am

I see. It’s the random initialization of weights that I missed if I’m not mistaken. If that’s the case then it makes sense.
Thanks.

Topic		Replies	Views
What if we get local maxima when we choose w, b in gradient descent algorithm Supervised ML: Regression and Classification week-1	2	522	July 7, 2022
Gradient descent fails at local maximum initial values? Supervised ML: Regression and Classification week-1	2	550	June 26, 2022
Gradient descent local maximum Supervised ML: Regression and Classification week-1	2	504	July 7, 2022
Doubt regarding a potential limitation of gradient descent Supervised ML: Regression and Classification week-1	5	100	June 10, 2024
Gradient Descent Graph Supervised ML: Regression and Classification week-1	2	503	February 8, 2023

Gradient Descent starting at the "top of the hill"

Related topics