Gradient descent local maximum

Xujie_Yuan · July 7, 2022, 1:28pm

If the value corresponding to w initialization on cost function J is a local maximum, like at the top of a mountain, the value of the derivative term is also always zero, and the gradient descent algorithm will not work, how can this problem be solved please?

rmwkwok · July 7, 2022, 1:41pm

Hi @Xujie_Yuan, the training algorithm will stop without any cost reduction, and if you see this, please re-train the model with a new initialization of parameters. We usually use random initialization for parameters.

Raymond

TMosh · July 7, 2022, 3:45pm

For most simple regression systems, the cost function is known to be convex. So there cannot be any local maxima.

Topic		Replies	Views
Gradient descent fails at local maximum initial values? Supervised ML: Regression and Classification week-1	2	554	June 26, 2022
What if we get local maxima when we choose w, b in gradient descent algorithm Supervised ML: Regression and Classification week-1	4	528	January 20, 2025
Doubt regarding a potential limitation of gradient descent Supervised ML: Regression and Classification week-1	5	101	June 10, 2024
Gradient Descent starting at the "top of the hill" Supervised ML: Regression and Classification week-1	2	540	June 18, 2022
C1_W1_Gradient-Descent Supervised ML: Regression and Classification week-1	3	571	July 28, 2022

Gradient descent local maximum

Related topics