Local minimum vs Global minimum in the context of Gradient Descent

paulinpaloalto · December 29, 2022, 12:36am

Yes, as Raymond says, finding the actual global minimum is probably not possible, but the higher level point he also makes is that is not what we really want in any case, since it would most likely represent extreme overfitting on the training set. Remember that what we really want is balanced performance on the cross validation and test data, which is not the same data as the training data. Of course we hope that it has a very similar statistical properties, but it is different. Here’s a thread from DLS from a while ago that discusses these issues in more detail.

Topic		Replies	Views
Gradient Descent two local minima Supervised ML: Regression and Classification week-1	5	154	May 12, 2024
Local Optima with Gradient Descent Improving Deep Neural Networks: Hyperparameter tun	1	552	May 30, 2021
C1_W1_Gradient-Descent Supervised ML: Regression and Classification week-1	3	570	July 28, 2022
Local optima in gradient descent Neural Networks and Deep Learning	2	639	March 13, 2022
Query regarding local and global minima Supervised ML: Regression and Classification week-1	2	530	July 1, 2022

Local minimum vs Global minimum in the context of Gradient Descent

Related topics