Why logistic regression is not used to calculate gradient descent

shouryaangrish · May 3, 2024, 5:47am

When creating a post, please add:

Week # must be added in the tags option of the post.
Link to the classroom item you are referring to:

Description (include relevant info but please do not post solution code or your entire notebook)

why is cost function(log term) not plugged in while calculating gradient descent(w,b) ?

gent.spah · May 3, 2024, 5:53am

What do you mean?

jenitta.jebaraj · May 3, 2024, 6:08am

Hello @shouryaangrish
can you be little more clear in your question?

TMosh · May 3, 2024, 7:45am

Gradient descent uses the gradients of the cost function. It doesn’t directly use the cost function itself - only the gradients.

The gradients are found from the equations for the partial derivative of the cost function.

Ahmad92 · May 5, 2024, 5:00pm

@shouryaangrish

I am sorry, I didn’t fully understand your question.

Still, Logistic regression does use gradient descent as its main optimization method to find the best model parameters. Log loss is our cost function that is used for logistic regression. As, it is related to maximum likelihood estimation, it works especially well for classification issues.

Also, to get the cost function as low as possible, gradient descent is used. We can find the direction and size of changes that need to be made to the model parameters to get the smallest mistake by looking at the gradient of the log loss function.

shouryaangrish · May 8, 2024, 7:27am

Sorry i missed your replies,

Yeah i think i got it sorry for basic question,

so when cost function is calculated for logistic regression-

J[w,b]x = 1/m(1/2*Loss[w,b],y)

it is not put in as loss term for gradient descent for logistic regression which is calculated as below -

w = w - alpha*d/dw(J[w,b]x)

This J[w,b]x != J[w,b]x(^above)

Rather it just get’s the squared error cost function - 1/(2*m)sum(f[w,b]x - y)**2

instead of the Loss for logistic regression

-1/m*(y*sum[log(x)] + (1-y)*sum[log(1-x)])

shouryaangrish · May 8, 2024, 7:41am

so exactly my point cost function for logistic regression is -1/m*(y*sum[log(f[w,b]x)] + (1-y)*sum[log(1-f[w,b]x)])

whereas when we do the gradient we take it as (f[w,b]x - y)**2 basically the squared error cost function.

normal cost function and the gradient descent cost function are different

TMosh · May 8, 2024, 7:54am

You are correct in that the linear regression and logistic regression cost functions are different.

When you compute the partial derivatives (i.e. the gradients), they are also different.

shouryaangrish · May 8, 2024, 8:44am

Yup and the cost function for logistic regression is different with the cost function for gradient descent when we compute partial derivatives

Topic		Replies	Views
Why isn't the actual loss function for logistic regression not put in place of cost function while implementing gradient descent? Shouldn't the cost function containing the log function be partially differentiated? Supervised ML: Regression and Classification week-module-3	9	872	October 10, 2022
Misunderstandings On The Analytical Equations of GD In Logistic Regression Supervised ML: Regression and Classification week-module-3	3	467	January 8, 2023
What's the usage of J(w,b) for logistic regression? Supervised ML: Regression and Classification week-module-3	18	697	June 9, 2024
Logistic Regression: Difference between cost function & gradient descent Supervised ML: Regression and Classification week-module-3	5	578	August 8, 2022
Optional Lab: Gradient Descent1 Supervised ML: Regression and Classification week-module-1	4	514	April 28, 2023

Why logistic regression is not used to calculate gradient descent

Related topics