Is it possible that entropy could increase from father node to children nodes in decision tree algorithm

Steve_Cai · September 14, 2022, 5:18am

Hi,

I am just finished the session of “decision tree learning” about purity-based entropy function and weight-based entropy reduction process. I wonder if it is possible that entropy could increase from father node to children nodes.

Specifically for the cats and dog problem, let’s assume the variables as follows,

Father node(root node): C cats, D dogs, sum S as C+D, entropy as Hf,
Left Child node: C1 cats, D1 dogs, sum S1 as C1+D1,entropy as Hl,
Right Child node: C2(=C-C1) cats, D2(D-D1) dog, sum is S-S1, entropy as Hr.
The weighted entropy of children as Hc.

I am curious that if there is a combo (C,D,C1,D1) that Hf<Hc (I wonder if there is some theorem in Math to disproof this). Many thanks.

Best,
Steve

rmwkwok · September 14, 2022, 10:52am

Hi Steve,

This course uses entropy as the measure of impurity for splitting decision, such that we split a node if entropy decreases. In order to know whether the entropy will decrease or not, we calculate the entropy before splitting, and the sum of weighted entropies after applying a candidate split. Again, since the decision is based on reducing entropy, the sum of weighted entropy in the child nodes is always smaller than the entropy in the parent node.

Let’s say the entropy of the parent node be H_p, entropy of the left and right child nodes be H_l and H_r respectively, we have the weighted sum entropy for the child nodes as H_c = w_lH_l + w_rH_r under the condition that H_c < H_p.

H_c < H_p is the condition for a split.

Cheers,
Raymond

Steve_Cai · September 15, 2022, 2:13am

Hi Raymond,

Thanks for your reply. I know that

Whereas I am curious that mathematically if there exists a split which can achieve Hp <Hc. Many Thanks.

Steve

rmwkwok · September 15, 2022, 2:29am

If the splitting condition is to require H_c < H_p, then no, otherwise it is possible. If you are interested in other splitting conditions, please feel free to share them.

Topic		Replies	Views
Why do we need to take weightage comparision in entropy function? Advanced Learning Algorithms week-module-4	7	499	February 6, 2023
Explanation of the formula for Information Gain in the decision nodes Advanced Learning Algorithms week-module-4	1	289	January 10, 2024
Choosing a split via entropy calculations Advanced Learning Algorithms week-module-4	1	397	June 30, 2023
Decision/Regression Tree is the Most "Balance" Tree when Reduction in Entropy/Variance has the Highest Value Advanced Learning Algorithms week-module-4	2	491	September 1, 2022
Negative Value of Information Gain Advanced Learning Algorithms week-module-4	2	487	June 5, 2023

Is it possible that entropy could increase from father node to children nodes in decision tree algorithm

Related topics