How to understand "logistic regression is a linear classifier"

dacong2008 · April 16, 2021, 3:55pm

Saw this concept in the Week 2 Assignment, but could not fully understand it.

paulinpaloalto · April 16, 2021, 4:05pm

What that means is that there is a “geometric” interpretion of what Logistic Regression is doing: it is finding a hyperplane in the input space that divides the “yes” answers from the “no” answers. We are solving for the coefficients and bias of a linear transformation that gives the minimal cost. Once we have w and b, the hyperplane is defined by

w^T \cdot x + b = 0

Of course there is no guarantee that there is such a hyperplane that perfectly divides the samples in that way: it all depends on what the data actually is. That’s why LR gives us only 70% accuracy on the image recognition task in Week 2. Neural Networks will allow us to represent much more complicated decision boundaries and will thus give better results on this type of task in general.

dacong2008 · April 16, 2021, 5:27pm

Wow, great to know. Thanks Paul.

Sometimes I just could not find definitions for these kinds of concepts and thus felt they were hard to understand.

GordonRobinson · April 16, 2021, 5:50pm

Adding a little to what Paul said: even if there is no hyperplane that perfectly splits the samples, there may be one (or several) that helps: maybe one side is all one way, or there is a split where each side will be “close” to purely one way.

When we get to multi-layer networks it can help to think of the first layer as being a variety of such helpful splits that are giving information, and the next layer is combing the first level distinctions.

javier · April 16, 2021, 6:38pm

Adding to @paulinpaloalto excellent answer, you can also obtain a graphical representation of how a Linear Regression splits the space between “yes” and “no” answers using the Tensorflow Playground, setting up a NN with just 1 Neuron and letting it run for some epochs.
In the big square at the right, the Blue and Orange Backgrounds represent the space split generated by the linear regression
You will notice that the split is far from perfect, i.e. there are a lot of Orange points in the Blue space, and some Blue points in the Orange space. This is a limitation of the linear regression.

Tensorflow Playground

dacong2008 · April 16, 2021, 8:28pm

Thanks for the remarks

dacong2008 · April 16, 2021, 8:30pm

it is nice to visualise some concepts and remarks here. Thanks for the info.

Topic		Replies	Views
Logistic Regression vs Linear Regression in using MSE Neural Networks and Deep Learning coursera-platform	16	769	April 23, 2021
Week 2 Programming Assignment: Logistic Regression with a Neural Network Mindset Neural Networks and Deep Learning week-module-2 , coursera-platform	3	330	February 13, 2024
Intuition Decision Boundaries Neural Networks and Deep Learning coursera-platform	4	805	July 27, 2023
Why is Logistic Regression Weaker for Non-Linearly Separable Data? Neural Networks and Deep Learning coursera-platform	1	503	January 18, 2022
What do the images in the programming assignments mean? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	501	December 8, 2021

How to understand "logistic regression is a linear classifier"

Related topics