Decision tree stopping criteria

ljb1706 · November 5, 2023, 9:39am

Hi,

What don’t we explore the entire list of features and stop when there is none left to make an additional split?
We only have to explore each feature once, so why not stopping when there is none left?

Thanks,

Laurent.

Jamal022 · November 5, 2023, 5:24pm

Hey @ljb1706,

Well the process of exploring the entire list of features and stopping when there are none left to make an additional split is a valid approach, but it’s not always the most efficient or effective one. I will try to mention some reasons why is that:

Overfitting: Decision trees have the capacity to create complex, deep trees that perfectly fit the training data, but this often leads to poor generalization to unseen data (overfitting). Stopping the tree from growing when there are no features left can result in excessively deep trees that capture noise in the data and perform poorly on new data.
Computational Efficiency: Exploring all features and creating a tree until there are none left can be computationally expensive, especially in cases where there are many features or a large dataset. By setting stopping criteria, you can create smaller, more efficient trees.
Finally Interpretability: Deep decision trees can become very complex and difficult to interpret, which can be a problem when you want to understand the decision-making process of the model.

I hope it’s more clear for you now why using stopping criteria in decision trees is important.

Regards,
Jamal

TMosh · November 5, 2023, 5:28pm

Using an exhaustive search can be problematic if there are lots of features and you need to do this calculation very often.

ljb1706 · November 5, 2023, 7:46pm

Thanks for the detailed answer.

Topic		Replies	Views
Do limiting depth of decision tree prevent overfitting? Advanced Learning Algorithms week-3	4	447	December 10, 2023
Criteria to stop splitting Advanced Learning Algorithms week-4	9	536	January 7, 2023
Are decision trees always binary trees? Advanced Learning Algorithms week-4	1	641	February 5, 2023
Splitting on a Continous Variable for Decision Trees is Inefficient Advanced Learning Algorithms week-4	5	496	October 18, 2022
Reusing features in a decision tree Advanced Learning Algorithms week-4	1	589	July 4, 2022

Decision tree stopping criteria

Related topics