Purpose of Gradient descent? (asked by a rookie)

Christian_Simonis · March 18, 2023, 12:42pm

In addition to @gent.spah great answer:

side note: in linear regression the optimum can be calculated analytically with the normal equation which works particularly well if the number of features is not too large and the data set is not super big. Other wise (super many features + really big data): gradient descent can be superior due to its iterative optimization approach where no matrix inversion step [cubic complexity] is needed in contrast to the analytical solution w/ normal equation.
in general: in very complex tasks in the optimization problem you cannot just plot the costs (on the one hand because it’s usually multi-dimensional as @gent.spah‘s stated correctly), but also because these costs are not so easy and simple to compute in general. Actually we use gradient descent to make our next step within the optimization literally in this direction where we expect that the global optimum is and then carefully check again (over and over)…

These threads can be interesting for you, too:

Hope that helps!

Best regards
Christian

Topic		Replies	Views
Why Gradient Decent is required Neural Networks and Deep Learning coursera-platform	3	665	October 28, 2022
Why don't we get the minimum of a function mathematically instead of running gradient descent? Supervised ML: Regression and Classification week-module-1	4	578	August 8, 2022
In W1 Lab 4, why implement Gradient Descent when we can minimize J(w,b) using the quiver plot? Supervised ML: Regression and Classification week-module-1	1	500	July 11, 2023
Gradient Descent Local Minima Supervised ML: Regression and Classification week-module-1	15	749	December 31, 2022
Machine learning specialization week 1 Supervised ML: Regression and Classification week-module-1	2	418	August 22, 2023