C1_W1_Gradient-Descent

SantoshKumarDoodala · July 28, 2022, 5:51pm

as Prof. Andrew told about the local minimum and how it has the slope = 0 which makes it so that we are unable to achieve the global maximum, which in many cases results in not so good value of w and b.
So i am wondering what can we do get the global maximun?

TMosh · July 28, 2022, 6:04pm

There are two factors:

For most simple systems (like linear and logistic regression), the cost function is convex, so there are no local minima. So there is nothing to worry about.

When the cost function is not convex (such as for a neural network), you can train multiple times using different initial weight values, and use the one with the lowest cost.

Another strategy is to use a validation or test set, and accept any local minimum solution that gives “good enough” results.

SantoshKumarDoodala · July 28, 2022, 6:06pm

One more question :- in the 2nd picture , how to choose the correct path, because selecting a different point initially (first step) in path resulted in 2 different minimum value .

rmwkwok · July 28, 2022, 11:38pm

Hello @SantoshKumarDoodala, in practice, we do not have the information to choose a “correct path”, because we do not know in prior where the minima are, and if we had known them, we did not need to do gradient descent in the first place, because gradient descent is all about finding a minimum.

Although we can’t choose path, we can choose initialization, and we can choose our model architecture. Below are good ways of how we justify our choices.

Raymond

Topic		Replies	Views
Cost function - How can we make sure that we end up in the global minimum and not one of the local minima Supervised ML: Regression and Classification week-2	2	825	December 3, 2022
Gradient descent fails at local maximum initial values? Supervised ML: Regression and Classification week-1	2	554	June 26, 2022
Gradient Descent two local minima Supervised ML: Regression and Classification week-1	5	154	May 12, 2024
What if we get local maxima when we choose w, b in gradient descent algorithm Supervised ML: Regression and Classification week-1	4	528	January 20, 2025
Gradient Descent Local Minima Supervised ML: Regression and Classification week-1	15	746	December 31, 2022

C1_W1_Gradient-Descent

Related topics