Model Evaluation - are we really changing model?

Dave_Cotton · October 23, 2023, 5:18pm

Newbie alert here and appreciate any help.

I’m working through the Optional lab where we use PolynomialFeatures to “alter the model” for a better fit. It seems we are just simulating a model alteration by altering the training data (adding the x^2 feature). When I started this I thought there would be some way to directly alter the model equation, but I have not run across that. Is the lab approach the way its done in practice?

TMosh · October 23, 2023, 5:30pm

When you modify the data set, you’re adding features, and that means you have more feature weights.

This gives you a different (i.e. more complicated) model.

The basic technique of linear or regression is always the same - what varies is the specific data set and the number of features.

TMosh · October 23, 2023, 5:31pm

Yes, it is.

Dave_Cotton · October 23, 2023, 6:21pm

Thanks @TMosh. It seems strange to me but seems you know you stuff.

If you have a moment - if this is the industry approach, then the “model representation” is not actually encoded anywhere in the regression part of the solution. Does that imply that at prediction time, the transformation to add these feature values occurs upstream? And if it changes, we need to think about deployment of that little chunk of transformation code from an MLOps perspective?

TMosh · October 23, 2023, 7:18pm

If you add features to the training set, then you have to add features to predictions using the same method.

TMosh · October 23, 2023, 7:19pm

Yes. This can be tricky, because your prediction code has to know how to create the new features. So that’s best encapsulated in a support function, which is used in both training and predictions.

Dave_Cotton · October 23, 2023, 7:46pm

@TMosh thanks Tom. I appreciate your insight and knowledge sharing.

TMosh · October 23, 2023, 8:00pm

Note that this is only a useful technique in limited circumstances. It’s part of the standard “intro to machine learning” curriculum.

For example, you can’t learn the equation for the distance traveled by a falling ball if you only have elapsed time and distance - because the relationship involves the square of the time. So you’d need to add a t^2 term to get a good model.

But, if you’re using a model that has built-in non-linear functions (like a Neural Network), then you don’t need to create new features by hand. They will automatically be created in the hidden layers of the NN, because a NN always has a non-linear function in its hidden layers.

Topic		Replies	Views
CW W2 Lab 4: Creating feature vs changing model Supervised ML: Regression and Classification week-2	14	476	May 20, 2023
C1_W2_Lab04_FeatEng_PolyReg_Soln Supervised ML: Regression and Classification week-2	3	499	March 19, 2023
Quick question Advanced Learning Algorithms week-3 , how-to	3	19	October 25, 2024
Implementing polynoming regression? Advanced Learning Algorithms week-3	2	496	February 28, 2023
Optional lab: Feature engineering and Polynomial regression Supervised ML: Regression and Classification week-2	1	546	July 11, 2022

Model Evaluation - are we really changing model?

Related topics