Why using Gradient Descent

Cristhian_David_Pere · August 15, 2023, 2:39am

In classic linear regression courses such as econometrics we use OLS and not Gradient Descent. I just wanted to ask why we use Gradient Descent and not OLS or another approximation such as using lagrange multipliers, etc.

paulinpaloalto · August 15, 2023, 3:00am

The problem is that OLS is not a general method: it works for linear regression, but not for full neural networks and for classification problems where the loss function is not based on Euclidean distance. For that matter linear regression has a closed form solution called the Normal Equation, but the computational complexity of that is higher than Gradient Descent, so with sufficiently large numbers of parameters, Gradient Descent can actually be more efficient.

Gradient Descent is a general method that applies in all cases. We are just getting started on Neural Networks here and there is much more to learn. They come in lots of different architectures and can be used to address lots of different types of problems.

I am not familiar with Lagrange Multipliers, but here’s a tutorial from Jason Brownlee’s website about how they might apply in ML contexts. But you can assume that Prof Ng and Yann LeCun and Geoff Hinton and all their grad students over the years know a lot of mathematics and there’s a reason why the field in general uses “Conjugate Gradient” methods like Gradient Descent to solve the optimization problems here. In other words, it not just that they haven’t thought of the other methods you mention.

Topic		Replies	Views
C1_W1_Learning rate Advanced Learning Algorithms week-module-1	3	330	December 21, 2023
Questions about week 1 content Supervised ML: Regression and Classification week-module-1	1	523	July 28, 2022
Gradient Descent in Built-in Python/R Libraries for Models Supervised ML: Regression and Classification	3	308	July 14, 2022
Does anybody use conjugate gradients method for training neural networks Advanced Learning Algorithms week-module-2	1	468	February 23, 2023
Optional Lab: Gradient Descent1 Supervised ML: Regression and Classification week-module-1	4	514	April 28, 2023

Why using Gradient Descent

Related topics