Why do i get weird score after scaling data?

unnatiii · September 20, 2024, 6:22pm

hi im working on house price prediction

i have 300.000000 < total_sqft <30000.00000, and other two features have values between 1 to 16 ie for bath and bhk feature rest feature are one hot encoding(for location)…so applying feature scaling on total_sqft (StandardScaler())but then i get same score on training data(before and after scaling) but cross val data give score like this -3.521060163109078e+22

i get same results when i do this
x_train_scaled = x_train.copy()
x_train_scaled[‘total_sqft’] = x_train_scaled[‘total_sqft’] / x_train.total_sqft.max()
x_train_scaled[‘bath’] = x_train_scaled[‘bath’] / x_train.bath.max()
x_train_scaled[‘bhk’] = x_train_scaled[‘bath’] / x_train.bhk.max()

can anyone explain? what im doing wrong what i misunderstood

rmwkwok · September 21, 2024, 12:32am

Hello, @unnatiii, one thing that I spot is, you have not scaled your x_cv accordingly. You can imagine what happens if your trained model received some non-scaled feature values.

Raymond

Topic		Replies	Views
Feature Scaling: Why don't we feature scale the training, cross validation and test data seperately? Advanced Learning Algorithms week-3	4	24	September 17, 2024
Week 2, C1_W2_Lab04_FeatEng_PolyReg_Soln, "Scaling Features" example - z-score scaling intuition Supervised ML: Regression and Classification week-2	2	18	November 6, 2024
Question: Why we don't use standardization in feature scaling? Supervised ML: Regression and Classification week-3	3	487	July 24, 2022
Translating scaled features results to original values Supervised ML: Regression and Classification week-2	3	477	January 17, 2023
How to implement the feature scaling in prediction? Supervised ML: Regression and Classification week-2	1	524	June 23, 2022

Why do i get weird score after scaling data?

Related topics