Scikit Regression comparison of train vs normalized X

Venkat_Subramani · July 4, 2023, 9:27pm

Hi
Why do we see difference in output of sgdr class and more iterations with X_Norm as compared to X_train?

Code from Lab:
sgdr = SGDRegressor()
sgdr.fit(X_train, Y_train )
print ( sgdr)
print(f"number of iterations completed: {sgdr.n_iter_}, number of weight updates: {sgdr.t_}")

SGDRegressor(alpha=0.0001, average=False, early_stopping=False, epsilon=0.1,
eta0=0.01, fit_intercept=True, l1_ratio=0.15,
learning_rate=‘invscaling’, loss=‘squared_loss’, max_iter=1000,
n_iter_no_change=5, penalty=‘l2’, power_t=0.25, random_state=None,
shuffle=True, tol=0.001, validation_fraction=0.1, verbose=0,
warm_start=False)
number of iterations completed: 146, number of weight updates: 14455.0

===========

Code modified to pass X_train instead of X_norm to sgdr:

sgdr = SGDRegressor()
sgdr.fit(X_train, Y_train )
print(f"number of iterations completed: {sgdr.n_iter_}, number of weight updates: {sgdr.t_}")
print ( sgdr)

number of iterations completed: 42, number of weight updates: 4159
SGDRegressor()

TMosh · July 4, 2023, 10:32pm

You can get a better fit by using normalized features. You can also use a larger learning rate, so direct comparisons can be difficult.

Do you have some metrics for the performance of both models?

Venkat_Subramani · July 5, 2023, 11:03pm

Yes, you could try passing X_train to the sgdr.fir ( Line 7 ) of the pbook below…

sgdr = SGDRegressor()
sgdr.fit(X_Norm, Y_train )
print(f"number of iterations completed: {sgdr.n_iter_}, number of weight updates: {sgdr.t_}")
print ( sgdr)

Optional lab: Linear regression with scikit-learn | Coursera

Topic		Replies	Views
C1_W2 scilearn does not work with feature normalization on X_test Supervised ML: Regression and Classification week-2	15	541	August 5, 2022
How to use SGDRegressor to get prediction for a specific input? Supervised ML: Regression and Classification week-2	3	498	January 5, 2023
How to plot Cost vs. Iteration when using SGDRegressor? Supervised ML: Regression and Classification week-2	5	95	October 28, 2024
Solving linear regression with sklearn gives me wrong result Supervised ML: Regression and Classification	3	359	July 19, 2022
SKLearn Logistic Regression output VERY different weights than manual Gradient Descent imp Supervised ML: Regression and Classification	3	266	July 6, 2022

Scikit Regression comparison of train vs normalized X

Related topics