Week 2 loss function

algekai · May 21, 2021, 12:48pm

Why would the square error function give us multiple local minimum?

Nicolas · May 23, 2021, 10:27am

This is a convexity problem. You can plot for instance the function f(x) = (x*sin(x))**2 which has many local minima.

Applying gradient descent to such a function can be troublesome (yet possible, but it would be cheating a little bit because we know where is the minimum and it has only one variable, not thousands )

This was the motivation given in the course to use rather the logistic loss function, leading to a convex minimization problem

Topic		Replies	Views
Gradient descent C1_W1 Supervised ML: Regression and Classification week-1	2	513	July 31, 2022
When there are multiple local minima? Supervised ML: Regression and Classification week-1	13	877	May 27, 2023
Need help grasping intuition behind square error cost function and multi-variable regression model Supervised ML: Regression and Classification week-1	3	512	May 1, 2023
Local Optima with Gradient Descent Improving Deep Neural Networks: Hyperparameter tun	1	546	May 30, 2021
Gradient Descent local max Supervised ML: Regression and Classification week-1	5	523	May 11, 2023

Week 2 loss function

Related topics