Finding local minima of Cost Function

Huzaifa · May 22, 2021, 6:19pm

Why cant we just set the derivative of cost function to zero to find the local minima, instead of using the gradient descent algorithm?

paulinpaloalto · May 22, 2021, 6:22pm

It’s a natural question, but it turns out doing that doesn’t help: it just makes the problem more complicated. The reason is that just gives you an equation that you can’t solve in “closed form”, so then you need yet another type of iterative approximation method to estimate the zeros of the derivative. E.g. something like the multi-dimensional analog of Newton-Raphson for approximating the zeros of a univariate function, but that would involve taking the second (partial) derivatives of the cost. It’s simpler just to apply Conjugate Gradient methods to the cost function directly.

Huzaifa · May 25, 2021, 11:32am

Thanks! I appreciate the help.

Topic		Replies	Views
Why don't we use derivate of cost function and make it zero to find local minimum Supervised ML: Regression and Classification week-1	5	530	November 16, 2023
Gradient descent and derivatives Neural Networks and Deep Learning coursera-platform	2	360	October 6, 2023
Why Gradient Decent is required Neural Networks and Deep Learning coursera-platform	3	656	October 28, 2022
Gradient Descent vs Searching for Minimum AI Discussions	2	70	July 17, 2022
Convergence of Gradient Descent Supervised ML: Regression and Classification week-2	1	509	July 10, 2022

Finding local minima of Cost Function

Related topics