Deciding when to use a Decision Tree Model

Luke_Rogers · October 5, 2022, 4:47am

Hi! I’ve been reading some further material around Tree based models after the first 2 videos: When and Why Tree-Based Models (Often) Outperform Neural Networks | by Andre Ye | Towards Data Science

Curious to understand
a) Is there a general rule of thumb as to when one should use a decision tree vs. another model e.g. Neural Network?
b) Would it be advised to implement both models and compare training sets?
c) If yes - then is there any value in amalgamating the two models e.g. A Decision Tree that calls on a Neural network to classify?

It would be great to have some further examples of ML use cases and the types of models chosen and why.

Cheers,
Luke.

TMosh · October 5, 2022, 6:43am

a) No.
b) Yes.
c) It depends on how you define “value”. Try it and see.

Dalila · October 8, 2022, 5:25pm

Hi @Luke_Rogers ,
I don’t think there is a rule of thumb that you can always apply.

It depends on the type and volume of data and also on the business problem.

When it comes to types of data :
You should use decision trees with tabular/ structured data and Neural Networks with unstructured data (images, videos, audio etc)
When it comes to volume : You should use Neural Networks when you have a large amount of data
When it comes to type of business problems :
If interpretability is more important than performance you should use Decision Trees on the contrary if performance is more important than interpretability Neural Networks might be better

Hope this helps
Dalila

rmwkwok · October 9, 2022, 2:01am

Hi @Luke_Rogers,

I agree that there is no general rule of thumb for that, especially when you can combine two models - no matter how you combine them. A rule of thumb is to evaluate your work by a cv set to figure out what is the best model, or what is the best way to combine the models.

The above is my answers and I am sorry that they may not be what you are asking for. I have never seen a Standard Operation Procedure for that either. However, when a tabulated dataset comes to me, I would try Gradient boosted decision trees first; whereas when a image dataset comes to me, I would find a pre-trained image network first. Then I can start my investigation and improvement cycles.

if you have done some work and have some findings to share and discuss, you are welcomed to post them.

Cheers,
Raymond

Luke_Rogers · October 9, 2022, 3:26am

Thanks All - I really appreciate the feedback.
@rmwkwok As you say I’m essentially using trial and error approach to find optimal models and find that especially while I’m in my infancy of ML this is useful since my assumptions on what would be best aren’t always correct.

Noted the course material did actually go on to explain when to use decision trees (primarily structured data) I was just curious (& impatient).

Cheers,
L.

rmwkwok · October 9, 2022, 11:53am

You are welcome @Luke_Rogers I appreciate your understanding too.

I think the best part about ML is when we really do something, and it is also the moment we can discuss something further, and more meaningful. I look forward to such discussion in the future

Raymond

Topic		Replies	Views
Decision Trees vs Neural Networks Advanced Learning Algorithms week-4	7	264	June 24, 2024
Decision Trees for Regression? Advanced Learning Algorithms week-4	7	619	September 23, 2023
When to use decision tree? Advanced Learning Algorithms week-4	5	522	February 3, 2023
When Trees Outdo Neural Networks: Decision Trees Perform Best on Most Tabular Data AI Discussions the-batch , ai-discussions	1	106	May 20, 2023
When to use regression vs neural network model Advanced Learning Algorithms week-3	4	386	August 14, 2023

Deciding when to use a Decision Tree Model

Related topics