Explain threshold in logistic regression

Praveen_Titus_F · November 9, 2022, 5:06pm

After finding optimum parameters in logistic regression using gradient descent for which the cost function is low, why is the threshold 0.5, can’t it be any other value.
It would be nice if someone give an explanation on it.

TMosh · November 9, 2022, 5:32pm

The values for True and False are 1.0 and 0.0.

The threshold is exactly halfway between them.

Praveen_Titus_F · November 9, 2022, 5:37pm

Thank you @TMosh , if possible can get some reading regarding this threshold.

TMosh · November 9, 2022, 5:51pm

What sort of additional information are you looking for?

paulinpaloalto · November 9, 2022, 6:10pm

It’s not a deep or subtle point. The idea is that the output of sigmoid looks like a probability, right? So we treat it as the probability that the answer is “Yes” for a given sample. So if it is > 0.5, that’s a positive answer, else it is interpreted as the model predicting “No” for that sample.

Praveen_Titus_F · November 9, 2022, 6:13pm

Does 0.5 works all time or need to adjust the value based on accuracy. For instance take ROC curve for tuning threshold. Correct me if i m wrong, Thank you

paulinpaloalto · November 9, 2022, 6:14pm

Yes, 0.5 works all the time. If the accuracy is bad, then either you need more or better training data or Logistic Regression is not going to be good enough for your problem and you need to consider a real Neural Network (stay tuned for that).

The point is that Logistic Regression can only do “linear separations”: the decision boundary looks like a hyperplane that is expressed by:

w^T \cdot X + b = 0

Notice that sigmoid(0) = 0.5. So whether LR will do a good job or not depends on whether your actual data is linearly separable. Sometimes that is the case and sometimes it’s not …

Praveen_Titus_F · November 9, 2022, 6:27pm

Thank you @paulinpaloalto , so when z = 0 then sigmoid(z) = 0.5, thats the hyperplane that separates positives n negatives ie (1 and 0).

paulinpaloalto · November 9, 2022, 6:28pm

Yes, that is the point: the decision boundary is at 0.5 for the output of sigmoid.

Praveen_Titus_F · November 9, 2022, 6:29pm

Awesome, Thanks you!! @paulinpaloalto @TMosh

Nicolas · November 10, 2022, 9:55am

A little addition. I advise you to read about precision and recall
As explained, the 0.5 value can be interpreted as a probability of belonging to class 0 or 1. But what if you want to reduce the number of misclassified 0’s or 1’s ? Then you can tune this value so to have less misclassified on one side in exchange for more on the other.
This is important for instance in medicine, if you don’t want to miss a tumor. You would prefer to have more false alarms than missing one, so you may want to tune the threshold to 0.4 for instance if 1 is the tumor label.
Same thing for a model who would say hello at the door when there is someome. You don’t want it to activate every time there is a cat, so you may want to tune your threshold too

Topic		Replies	Views
Threshold -sigmoid function Supervised ML: Regression and Classification week-module-3	2	330	November 1, 2023
Gradient Descend Lab Intuition Supervised ML: Regression and Classification week-module-3	7	58	November 5, 2024
Logistic Regression with Only Negative Predictions Supervised ML: Regression and Classification week-module-3	1	506	August 28, 2022
Logistic regression 0.5 threshold Supervised ML: Regression and Classification week-module-3	2	66	October 15, 2025
Purpose of Bias? NLP with Classification and Vector Spaces week-module-1	11	658	December 1, 2022

Explain threshold in logistic regression

Related topics