Explanation of the formula for Information Gain in the decision nodes

Alexandros_Paipetis · January 10, 2024, 7:59am

Hi there,

I would like to ask the following.

When computing the reduction in entropy, so you can choose how to split, I understand that when you split the root node the formula is going to be:

1 - Weighted Average Entropy

We subtract by 1 because the entropy is originally 1 at the root node.

My question is what would the formula be when splitting the other decision nodes of the tree.

I understand how to compute the weighted average entropy but I am not sure if this should be subtracted by 1 as we did for the root node, and if yes what is the reason. I.e., is the entropy at the parent node always 1?

Thanks.

TMosh · January 10, 2024, 6:24pm

Perhaps your question is covered in the “Choosing a split: Information Gain” lecture? Or perhaps in the “Putting it together” lecture.

I believe the splitting process is identical at each node.

Topic		Replies	Views
Is it possible that entropy could increase from father node to children nodes in decision tree algorithm Advanced Learning Algorithms week-3	3	542	September 15, 2022
Negative Value of Information Gain Advanced Learning Algorithms week-4	2	481	June 5, 2023
Choosing a split via entropy calculations Advanced Learning Algorithms week-4	1	397	June 30, 2023
Information Gain calculation query Advanced Learning Algorithms week-1	2	446	June 6, 2023
Confusion about the concepts of entropy and information gain in Decision Tree! Advanced Learning Algorithms week-4	1	336	September 2, 2024

Explanation of the formula for Information Gain in the decision nodes

Related topics