What does the cost function of logistic regression look like?

paulinpaloalto · December 26, 2023, 1:13am

We use the standard “cross entropy” loss function for logistic regression and also for neural networks where the predictions are binary classifications (yes/no). The cost function is convex in the case of Logistic Regression, but it is not in the case of Neural Networks, because the cost function maps all the way from the input values to the final cost meaning that all the non-linear layers are included in the case of a Neural Network.

Here’s a thread which shows the graph of log(\hat{y}) which is the core of the cross entropy loss, if you just look at the function applied to the output, as opposed to the entire mapping from inputs to cost. You can clearly see that it is convex if you use the full -log(\hat{y}) function, which flips the graph shown about the x axis.

Here’s a thread which shows the graphs of the cross entropy cost surfaces versus an MSE cost function on a binary classifier. It’s not a mathematical proof, but a picture is worth a lot of words.

Here’s a thread which discusses more about the non-convexity of NN based classifier cost function.

Topic		Replies	Views
Why MSE is non-convex for Logistic regression Neural Networks and Deep Learning coursera-platform	3	768	December 26, 2021
Mse Cost function Neural Networks and Deep Learning coursera-platform	6	729	June 6, 2024
Concave Convex functions AI Discussions ai-discussions	5	137	June 16, 2024
Cost function convexity question Neural Networks and Deep Learning coursera-platform	1	352	September 11, 2023
Nonconvexity -logistic Supervised ML: Regression and Classification week-3	3	439	November 5, 2023

What does the cost function of logistic regression look like?

Related topics