SKLearn Logistic Regression output VERY different weights than manual Gradient Descent imp

Lizhang_Qin · July 6, 2022, 4:27am

C1_W3_Lab06_Gradient_Descent_Soln
and
C1_W3_Lab07_Scikit_Learn_Soln

has the same X_train and y_train set

But for SKLearn version, if we output
print(“weights:”, lr_model.coef_)

We will see the weights are very different from manual impl version.
– lab06 output:
Iteration 0: Cost 0.684610468560574
Iteration 1000: Cost 0.1590977666870456
Iteration 2000: Cost 0.08460064176930081
Iteration 3000: Cost 0.05705327279402531
Iteration 4000: Cost 0.042907594216820076
Iteration 5000: Cost 0.034338477298845684
Iteration 6000: Cost 0.028603798022120097
Iteration 7000: Cost 0.024501569608793
Iteration 8000: Cost 0.02142370332569295
Iteration 9000: Cost 0.019030137124109114

updated parameters: w:[5.28 5.08], b:-14.222409982019837

 Lab07 output for the same data
 weights: [[0.90411349 0.73587543]]

Do we know why? Or did SKLearn internally doing some feature scaling already?
And also, how to output the ‘b’ from SKLearn result?

Thanks!

rmwkwok · July 6, 2022, 4:49am

Hi Lizhang,

I think we can’t explain it without diving into the implementation. Some notes here:

sklearn enables regularization for logistic regression by default. You need to turn it off yourself. The lab doesn’t have regularization.
sklearn uses tol as one of ways to stop training. The lab doesn’t have that, but only number of iterations.
sklearn offers a few solvers, and if you try them one by one, each of them will give you a different set of weights, that’s a hint that their implementations could be different from ours

You can find about regularization setting, choices of solvers, training stopping criteria, and how to print the b (which is called intercept) in sklearn doc.

Raymond

rmwkwok · July 6, 2022, 5:08am

Lizhang, if you want to further investigate this, please make sure the weights are converged, before comparing them.

Lizhang_Qin · July 6, 2022, 3:43pm

Very helpful. Thanks for the detailed reply!

Topic		Replies	Views
Optional lab: Gradient descent with and without using scikit-learn Supervised ML: Regression and Classification week-module-3	4	572	January 11, 2024
Linear Regression Model implementation Supervised ML: Regression and Classification week-module-2	18	154	June 24, 2025
Why Sklearn logostic regression may produce the worse results? AI Discussions	3	63	August 18, 2023
Where can I get more information about how this lbfgs was created AI For Medical Treatment week-module-1	2	373	November 8, 2023
Effect of feature scaling on a model's parameters Supervised ML: Regression and Classification week-module-2	4	33	December 3, 2024

SKLearn Logistic Regression output VERY different weights than manual Gradient Descent imp

Related topics