C1W1_nb03: intuition of the line that separates positive and negative regions

Guangye_Cao · May 12, 2024, 10:38pm

In the 3rd lab titled " Visualizing tweets and Logistic Regression models", we plot a line to show the cutoff between the positive and negative regions. The gray line is the where the dot product of theta and X equal to 0, meaning they are perpendicular. What is the intuition behind this?

TMosh · May 12, 2024, 11:02pm

You are correct that the dot product of two orthogonal vectors is zero.

But that’s not what’s happening here.

The dot product is an efficient way of computing the linear combination of the weights and features of an example. If that value is positive (or zero), you have a “True” result. If it’s negative, you have a “False” result.

Guangye_Cao · May 13, 2024, 4:28pm

Thanks for your reply! Could you please elaborate on why when theta*x is positive, it’s a true result, and false otherwise? Is this specific to logistic regression?

TMosh · May 13, 2024, 4:35pm

Yes, for logistic regression, that is by definition.

paulinpaloalto · May 13, 2024, 4:38pm

Remember that we take sigmoid(\theta \cdot x) and then that is interpreted as the probability of a “yes” answer. Note that sigmoid(0) = 0.5 and sigmoid is monotonic. So a positive input gives you a probability > 0.5 and a negative input to sigmoid gives you a probability < 0.5.

As Tom says, that’s the definition of Logistic Regression. The other important point is that this doesn’t “just happen”: we train the function so that it learns the coefficients (the elements of \theta) which give the best possible match to the training data we are using. Of course there is no guarantee that your data is “linearly separable”, so Logistic Regression may not work well in all classification cases. In many cases, we will need a more complex decision boundary, so we will need more expressive functions. One approach to that is to graduate to Neural Networks, which will be covered in a later NLP course.

Guangye_Cao · May 20, 2024, 6:08pm

Thank you for the detailed explanation.

Guangye_Cao · May 20, 2024, 6:08pm

Thank you!

Topic		Replies	Views
Query Regarding Logistic Regression Visualization NLP with Classification and Vector Spaces week-1	3	517	February 6, 2023
The math of "Plot the model alongside the data" NLP with Classification and Vector Spaces week-1	2	511	May 28, 2022
C1_W3 Non-linear decision boundaries Supervised ML: Regression and Classification week-3	7	277	February 26, 2024
Non-linear Decision Boundary Supervised ML: Regression and Classification week-3	3	712	June 24, 2022
Planar data classification: why logistic regression gives us a straight line where as a shallow neural network can more specifically classify the points Neural Networks and Deep Learning coursera-platform	2	563	September 26, 2021

C1W1_nb03: intuition of the line that separates positive and negative regions

Related topics