Compute information gain question

jleanezv · June 30, 2022, 10:59pm

Hello, I was completing the compute_information_gain function in the Decision tree lab, and I was using the y values instead of the y_node values to compute the entropy, getting the same output as the expected one. However, the test was not passing until I checked and said to use y_node. Why do we need to use y_node instead of just y?

rmwkwok · July 1, 2022, 4:36am

Hi @jleanezv,

I think you are talking about the practice lab exericse 3. In it, y is the label, and y_node = y[node_indices] which is a subset of the y. Therefore, when node_indices does not contain all indices of y, then y_node and y become different.

Since they can be different, we need to use the right one. And y_node is the right one, because it contains only labels that is concerned as defined by node_indices.

Cheers!
Raymond

Topic		Replies	Views
C2_W4_Decision_Tree_with_Markdown - why do we need X_node Advanced Learning Algorithms week-4	7	559	April 24, 2023
C2 W4 Decision Tree with Markdown - Information gain Advanced Learning Algorithms week-4	9	370	November 30, 2023
Information gain calculation problem Advanced Learning Algorithms week-4	5	427	October 24, 2023
CW2_W4_Decision_Tree_with_Markdown information_gain fails? Advanced Learning Algorithms week-4	2	23	August 27, 2024
Compute_information_gain_test_error Advanced Learning Algorithms week-4	1	449	August 15, 2023

Compute information gain question

Related topics