C2_W4_Lab_02_Tree_Ensemble. Continuous valued features

Goran_Hrzenjak · February 12, 2023, 11:42am

Among the 5 categorical features in the heart.csv dataset:
.

there are 3 continuous value features:

RestingBP
Cholesterol
RestingECG
MaxHR
Oldpeak

image937×207 7.48 KB

The categorical features have been taken care of with one-hot encoding - pd.get_dummies.
The continuous value features are used as is in the model and during the training.

Shouldn’t the continuous value features be split as mentioned in the lecture Continuous valued features? I guess that, for example, the level of cholesterol can have some impact on the hart disease.

Am I missing something, seeing something wrong?

Thanks

saba_odisharia · February 12, 2023, 1:04pm

If I am correct the algorithm takes care of the continuous value features when it chooses the splits during training. When choosing a split it will consider categorical and continuous features as described in lectures. Decision trees work for the both types of data, you encode categorical features because most implementations of the model don`t accept non-numerical data as inputs.

Goran_Hrzenjak · February 12, 2023, 2:33pm

Thanks! I was on the wrong path.
Goran

Topic		Replies	Views
Continuous Value Splitting Advanced Learning Algorithms week-4	2	704	January 4, 2023
Bisection search for continuous valued features case? Advanced Learning Algorithms week-4	1	504	August 20, 2022
Can decision trees be applied to multi-class classification tasks? Advanced Learning Algorithms week-4	2	20	January 8, 2025
Decision trees, one-hot encoding, and multicollinearity Advanced Learning Algorithms week-4	6	308	February 13, 2024
Having a doubt in a data science project AI Discussions	1	47	May 18, 2023

C2_W4_Lab_02_Tree_Ensemble. Continuous valued features

Related topics