Suggestion for introduction of the loss function in "Logistic Regression"

dtonhofer · January 7, 2025, 2:25pm

When the “loss function” is introduced for the “Logistic Regression”, Professor Ng starts by giving the formula for the algebraized (one-linerized?) “cross entropy loss” for a 2-class case, then justifies why this looks like a good choice.

I would suggest doing it the reverse way:

Introduce the idea of “cross-entropy loss” for a 2-class case. In particular name the concept of cross entropy loss in case students would like to look it up to learn more about it. It is not yet a single formula but distinguishes cases y=0 and y=1 explicitly.
It this form it is easy to see that it takes on values in a way that we would like to have in an adequate loss function.
Set up a single algebraic loss function formula using by the usual trick of summing over terms: y * (value in case y = 1) + (1-y) * (value in case y = 0)
And that’s it.

TMosh · January 7, 2025, 3:17pm

Thanks for your suggestion.

dtonhofer · January 8, 2025, 1:26pm

I just found out that the one-liner formula can also be seen as being derived from a log-likelihood estimation approach

The “generative model” being in this case, the neural network. With the weights the model parameters that we want to minimize over.

This is explained later in the course though much too quickly I will need to delve into this deeper.

Topic		Replies	Views
Week 1 questions Sequence Models coursera-platform	1	526	December 26, 2021
Week 1, lab 2, counting labels and weighted loss AI for Medical Diagnosis week-1	3	383	November 10, 2023
Entropy function and logistic lost Advanced Learning Algorithms week-4	3	608	October 30, 2022
Cross-entropy Supervised ML: Regression and Classification week-3	2	38	October 23, 2024
Logistic Regression Cost Function Intuition start around 3:24 Neural Networks and Deep Learning week-2 , coursera-platform	3	252	March 25, 2024

Suggestion for introduction of the loss function in "Logistic Regression"

Related topics