Difference between .transform and .fit_transform when feature scaling

AnahadhM · January 1, 2025, 11:11pm

In the Model Evaluation and Selection lab, there are two different functions used by the notebook author when feature scaling using scikit-learn’s StandardScalar(). Those two functions are .transfrom(x_train) and .fit_transform(x_train). What is the difference between their functions?

Additionally, when the notebook author uses the PolynomialFeatures() object, they also use a .fit_transform(x_train) function. Does that function do the same thing in PolynomialFeatures() and StandardScalar(), or are they different?

gent.spah · January 2, 2025, 11:08am

Below, you can find clarifications for fit_transform and transform methods; I am not sure about the other functions because I haven’t gone through this course.

.fit_transform(x_train):

This method is a combination of two steps: .fit() and .transform().
.fit(x_train): This step computes the necessary statistics or parameters from the data (e.g., mean and standard deviation for scaling, or the unique categories for encoding). It essentially “learns” from the data.
.transform(x_train): After fitting, this step applies the transformation to the data using the learned parameters.
.fit_transform(x_train): By combining these two steps, it performs both fitting and transforming in one go, which is often more convenient and efficient when you want to transform the training data.

.transform(x_train):

This method is used to apply a transformation to the data using the parameters that have already been learned with .fit().
It does not compute or learn anything new; it simply uses the existing parameters to transform the data.
You would typically use .transform() on new data (e.g., validation or test sets) after you have already fitted the transformer on the training data.

Topic		Replies	Views
Difference between fit and fit-transform Advanced Learning Algorithms week-3	1	514	March 10, 2023
StandardScaler.fit_transform vs StandardScaler.transform Advanced Learning Algorithms week-3	3	27	October 29, 2024
PolynomialFeatures transform vs fit_transform Advanced Learning Algorithms week-3	3	693	February 17, 2023
Difference between fit, fit_transform & transform Unsupervised Learning, Recommenders, Reinforcement week-2	2	24	July 8, 2024
StandardScaler Process Machine Learning Data Lifecycle in Production	1	531	December 3, 2021

Difference between .transform and .fit_transform when feature scaling

Related topics