Intuition for Log Loss Function

nishantmi · June 21, 2025, 6:08am

In this topic Classification with Perceptron - Gradient Descent

-yln(y^) - (1-y)ln(1-y^)

how did we reach this conclusion?

TMosh · June 21, 2025, 6:34am

By design. It’s an equation that emphasizes making correct predictions, and greatly penalizes incorrect predictions.

it also has the nice property of being related to the partial derivative of the sigmoid() function (used when computing the gradients).

paulinpaloalto · June 21, 2025, 3:40pm

Here’s a thread about loss functions and the “cross entropy” loss function that you are asking about. That thread is from DLS, which is a more advanced series that you may want to take after MLS, so the discussion may mention things that Professor Andrew has not yet mentioned here. But at least it’s worth looking at the graphs of the natural log function between 0 and 1. As the old saying goes, sometimes a picture is worth a thousand words.

Topic		Replies	Views
Logistic Regression Cost Function Intuition start around 3:24 Neural Networks and Deep Learning week-module-2 , coursera-platform	3	257	March 25, 2024
Help on derivatives for the loss function Neural Networks and Deep Learning week-module-2 , coursera-platform	2	920	March 24, 2024
Clarification of the Derivative of the Log Loss Function Neural Networks and Deep Learning coursera-platform	2	1055	April 17, 2022
W3_Classification with Perceptron - Gradient Descent_The origin of the loss function Calculus for Machine Learning and Data Science week-module-3	6	338	November 9, 2023
Calculation of partial derivative of the cost function for logistic regression Supervised ML: Regression and Classification week-module-3	60	234	February 25, 2025

Intuition for Log Loss Function

Related topics