Why is the sigmoid function's z term equal to "w*x+b" in logistic regression?

ggrant005 · February 9, 2024, 3:38pm

The sigmoid function is 1 / (1 + e^(-z)) where z = w*x+b. Why the z term is set to w*x+b? I understand why we use w*x+b in linear regression since that is the equation for a line, but I do not understand why we are able to use it in the sigmoid. Any explanation is welcome!

TMosh · February 9, 2024, 4:11pm

The logistic prediction is based on a linear prediction which is passed through a sigmoid function, so that its range is limited to the range of 0 to 1. This matches the binary (0 or 1) values that are the labels.

It turns out this is a good choice when you have to compute the partial derivatives of the cost (i.e. compute the gradients), so you can use gradient descent to find the weights that minimize the cost.

hackyon · February 9, 2024, 5:01pm

If you’re looking for some intuition on why logistic regression works, I recommend watching (or rewatching) the video on decision boundaries:

The video covers how the “line” from the equation can be visualized as a decision boundary for logistic regression. Basically, the line defined by the w*x+b can be used to divide up the input into two output classes.

This visualization doesn’t necessary work for higher dimension/more complicated systems, but I think it helps with the fundamental intuition behind it.

ggrant005 · February 9, 2024, 8:46pm

Rewatching with this question in mind did help, thanks for that. I am more interested now in how anyone figured this out in the first place.

TMosh · February 9, 2024, 8:56pm

Clever people have been working on logistic regression since at least the 1950’s.

Logistic regression is based on the original perceptron concept, which was a simplified simulation of the actions of a biological neuron.

ggrant005 · February 9, 2024, 9:46pm

That is awesome! I look forward to learning more of this stuff, including the long history

TMosh · February 9, 2024, 9:54pm

The courses focus on concepts and implementation, but doesn’t much discuss how we got here.

Topic		Replies	Views
Sigmoid (Optional Lab) Intuition Supervised ML: Regression and Classification week-3	2	25	October 25, 2024
Definition and interpretation of z in logistic regression Supervised ML: Regression and Classification week-3	6	724	March 28, 2023
C1_W3_Lab02_Sigmoid_function_Soln - How is 'z' calculated? Supervised ML: Regression and Classification week-3	3	519	September 26, 2022
Why do we use the sigmoid function at the end? Supervised ML: Regression and Classification week-3	2	279	December 18, 2023
Sigmoid function & Decision Boundary Supervised ML: Regression and Classification week-3	8	377	August 25, 2023

Why is the sigmoid function's z term equal to "w*x+b" in logistic regression?

Related topics