Cost function convexity question

iterentyev · September 11, 2023, 9:04pm

In one of the first lectures on Log Regression, Prof Andrew Ng says that cross-entropy function is convex (unlike least squares).

Does it still hold true when we get to multi-layer forward feed networks (with ReLU activations in hidden layers) in Week 4? I.e., does cross-entropy still stay convex with respect to all W^{[l]} and b^{[l]}?

TMosh · September 11, 2023, 9:32pm

If there is a NN with a hidden layer, then its non-linear functions (i.e. ReLU, sigmoid, etc) cause the cost function to be non-convex.

If you’re just doing simple linear or logistic regression, without a hidden layer, then both of those cost functions are convex.

Topic		Replies	Views
What does the cost function of logistic regression look like? Neural Networks and Deep Learning	2	424	January 4, 2024
Use of squared error with sigmoid and applying gradient descent Neural Networks and Deep Learning week-2 , ai-discussions	6	41	September 30, 2024
Concave Convex functions AI Discussions ai-discussions	5	137	June 16, 2024
Cost function convex why gradient decent Supervised ML: Regression and Classification week-1	5	590	June 13, 2023
Can someone explain to me how we got the convex loss function? Neural Networks and Deep Learning	4	983	June 30, 2021

Cost function convexity question

Related topics