Logistic Regression Cost

Tera_Byte · November 1, 2024, 9:00am

Hey everyone

Classroom Item: Week 3, Cost function for logistic regression

In this video, Andrew said that the logistic regression function {{f_w,b(x)}} predicts a value ranging from 0 to 1, then this value is used in computing the cost using logarithms. For example, if y (the actual output) was equal to 0 and the function predicts 0.5 or 0.7, then the cost will be very high. But the threshold can be y = 0 if f_w,b(x) <= 0.7 and 1 otherwise. So in this case the cost should be zero I think, as the algorithm predicted the actual output regardless of its value (0.5 or 0.7)

So, why is it necessary to use the function’s actual output (0.5 or 0.7) rather than the binary decision (0 or 1) at the decision boundary? From my understanding, the function f_w,b(x) should predict either 0 or 1 based on the chosen threshold and decision boundary.

Isn’t the final binary output (0 or 1) what’s actually used in prediction-making, rather than the continuous value of the function?

Alireza_Saei · November 1, 2024, 9:11am

Hi @Tera_Byte

The reason is that logistic regression aims to predict probabilities rather than hard binary outputs. This makes the model to learn how confident it should be for each prediction. If we only used the final binary output for the cost, the model wouldn’t learn the degree of error when it’s unsure. This makes the model to push probabilities closer to 0 or 1, based on its confidence level.

Hope it helps! Feel free to ask if you need further assistance.

Tera_Byte · November 1, 2024, 11:36am

@Alireza_Saei thanks for answering, your answer helped me understand but I have the feeling that I’m still missing something, what do you mean by “the degree of error when it’s unsure” and do you maybe have an example in mind that can expand on your answer?

Alireza_Saei · November 1, 2024, 11:51am

You’re very welcome @Tera_Byte !

I mean that logistic regression not only tries to predict the correct class but also considers how confident it is about that prediction.

For example, if the actual output is 0 and the model predicts 0.9 (strongly wrong) versus 0.4 (somewhat wrong), the cost function penalizes the 0.9 prediction more heavily because it’s more confident in an incorrect answer. This helps the model learn to be more precise, trying to push values closer to 0 or 1 based on its certainty.

Let me know if this clarifies things a bit more!

Tera_Byte · November 1, 2024, 6:55pm

Yeah, that answer did it for me. Thanks again @Alireza_Saei

Alireza_Saei · November 2, 2024, 7:17am

You’re welcome! happy to help

Topic		Replies	Views
Collaborative filtering binary label Unsupervised Learning, Recommenders, Reinforcement week-module-1	1	7	July 5, 2025
Cost Function of Logistic Regression , Binary Classification Supervised ML: Regression and Classification week-module-3	10	374	January 4, 2024
Optional Logistic Regression: Cost Function NLP with Classification and Vector Spaces week-module-1	3	574	June 28, 2022
Can logistic regression be replaced with ordinary linear regression Supervised ML: Regression and Classification week-module-2	24	1058	July 27, 2023
Cost Function and Loss Function Supervised ML: Regression and Classification week-module-3	10	845	September 20, 2023

Logistic Regression Cost

Related topics