Optional lab: Gradient descent with and without using scikit-learn

Cristhian_David_Pere · August 25, 2023, 3:06am

Why are the parameters obtained using scikit-learn different from those obtained using the gradient_descent function provided in the lab?

rmwkwok · August 25, 2023, 3:42am

Hello @Cristhian_David_Pere

The parameters are different in the first place. Also, sklearn’s LogisticRegression isn’t using gradient descent. The best we can hope for is, given the parameters are consistent, both approaches will be very similar instead of the same until the last digit.

Cheers,
Raymond

TMosh · August 25, 2023, 3:49am

As you can see from the cost history using the gradient descent method, the cost has not yet reached its minimum - the cost is still decreasing. This means the solution using gradient descent has not yet converged.

sklearn is using the lbfgs minimizer - not gradient descent. This is a more advanced optimizer that runs more efficiently than gradient descent. It finds a converged solution using fewer iterations than GD.

You can learn about the lbfgs minimizer here:

Wingster · January 11, 2024, 1:29am

Is it safe to assume that sklearn.LogisticRegression is using something other than the sigmoid function to fit the learning data ?
If it is using faster algorithm/solver to converge using same setup, I would assume the weights/bias will be similar.
I also noticed LogisticRegression isn’t asking for the input cost function, which leads me to believe it maybe solving using a different “setup”

I am new to this and the documentation on sklearn’s LogisticRegression isn’t that easy to digest. therefore, any insight you can shed is greatly appreciated.

thanks

TMosh · January 11, 2024, 6:11am

The standard for logistic regression is a linear model with sigmoid() applied to the output. I’m certain that is what sklearn uses also.

The cost function is built-in, since that determines the gradients, which are also built-in.

You can pick among several different optimization methods. They should all give very close to the same weights, just with different tradeoffs between processor load, memory usage, and speed of convergence.

Topic		Replies	Views
Why Sklearn logostic regression may produce the worse results? AI Discussions	3	60	August 18, 2023
SKLearn Logistic Regression output VERY different weights than manual Gradient Descent imp Supervised ML: Regression and Classification	3	267	July 6, 2022
Where can I get more information about how this lbfgs was created AI For Medical Treatment week-module-1	2	373	November 8, 2023
Does gradient descent of cost function give the same regression line as ordinary least squares? Supervised ML: Regression and Classification week-module-1	5	539	September 27, 2022
.fit() in sklearn package Supervised ML: Regression and Classification week-module-2	2	461	May 16, 2023

Optional lab: Gradient descent with and without using scikit-learn

Related topics