Using correlation matrix for feature selection

Christian_Simonis · January 24, 2023, 5:22am

You can describe the data with sin(t), cos(t) relation, see:

After transforming the data, they become linearly separable.

An other way: you can also solve a problem like this with polynomial approaches, see: https://towardsdatascience.com/the-kernel-trick-c98cdbcaeb3f

When analysing the residuum you do not want to see any systematic pattern. If the model did a bad job you could see at least some pattern in the residual data and no random “white noise”.

This can absolutely help in data understanding.
Why do you think it would be “better” to add new features?

I think the transformation w/o growing the dimensional space is already making the data linearly separable in a minimum dimensional space. To me it seemed quite elegant this way. But many approaches solve the issue.
In general: The most suitable approach depends on the data as well your business problem you are solving. Often in reality it is sufficient to find a solution which is just “good enough”.

Best regards
Christian

Topic		Replies	Views
Doubt in Feature Engineering - House Example Supervised ML: Regression and Classification week-2	3	547	October 18, 2022
Please help with this Neural Networks and Deep Learning	2	402	August 7, 2023
Need advice in Exploratory Data Analysis Supervised ML: Regression and Classification week-4	9	94	May 12, 2025
ML for Manufacture data AI Discussions ai-discussions , data-centric	2	95	February 3, 2024
CVS error for a model with all the features vs a few features AI Discussions	1	34	January 30, 2023

Using correlation matrix for feature selection

Related topics