What is decision boundary doing here?

staticgeek · July 21, 2024, 7:53pm

I don’t understand the difference between the- 0.5 threshold border and the orange line which is the decision boundary… isn’t the decision boundary supposed to divide the positive and negative regions, im really confused, then why is the vertical line doing its job here, it’s clear that the vertical line is seperating the two regions, why is that, then what is the importance of the orange line i.e. the decision boundary…

pastorsoto · July 21, 2024, 8:29pm

Hi @staticgeek the orange line is a linear function using regression model, while the blue line shows the sigmoid function, both are able to make predictions but in different way, the model classify as bening or malignant depending on the size of the tumor, the vertical line is using the sigmoid function to classify the tumor. Overall the grahs shows you the difference between using logistic regression and linear regression for binary classification

I hope this helps

rmwkwok · July 22, 2024, 3:22am

Hello, @staticgeek,

Only the vertical line is the decision boundary. The orange line is not.

We do not need the orange line to show the decision boundary. I would just ignore the orange line for the purpose of identifying decision boundary.

Cheers,
Raymond

staticgeek · July 22, 2024, 10:18am

ok thanks

staticgeek · July 22, 2024, 10:19am

thanks

TMosh · July 22, 2024, 3:03pm

I think the decision boundary in this figure is drawn wrong. It should be a horizontal line at 0.5 units.

The orange line is not the boundary, it is just the linear plot of z.

Wendy · July 22, 2024, 11:18pm

As @TMosh says, the orange line is not the decision boundary, but just a plot of z.

To clarify why the decision boundary is vertical in the figure and not horizontal:
This is a simplified example that is trying to estimate whether a tumor is benign or malignant based solely on tumor size (the x axis). The y axis represents the categorical result (1 = malignant, 0 = benign). The vertical decision boundary is showing us that (as far as we can tell from only tumor size), tumors greater than the corresponding tumor size are likely to be malignant and those smaller than that size are likely to be benign.

rmwkwok · July 23, 2024, 3:57am

If we want to connect everything (including the orange line) up, here is one way:

The boundary is \text{probability} = y = 0.5, for that we need z = 0, and in order to draw the boundary (with respect to tumor size, as Wendy explained), we need to solve z = 0.83x + (-2.21) = 0, which is actually looking for the x-intercept. Since it is the x-intercept, we see it is where the orange line and the vertical line cross each other.

Cheers,
Raymond

Topic		Replies	Views
How to interpret z (the orange line) in logistic regression? Supervised ML: Regression and Classification week-3	14	467	December 4, 2023
Sigmoid function & Decision Boundary Supervised ML: Regression and Classification week-3	8	382	August 25, 2023
Decision Boundary - What is the utility Supervised ML: Regression and Classification week-3	2	132	May 18, 2024
Decision Boundary Single feature Supervised ML: Regression and Classification week-3	9	378	November 1, 2023
[C1_W3_Logistic_regression] How does the plot_decision_boundary() does in details? Supervised ML: Regression and Classification week-3	3	528	August 20, 2022

What is decision boundary doing here?

Related topics