Compute_information_gain

Ghunemi · September 15, 2022, 11:43am

Hello, I implemented the method compute_information_gain and I receive unexpected error says " AssertionError: Wrong information gain. Expected 0.311278 got: 0.2822287189138014 "
Although my output is correct according to your expected answers.

Thanks in advance.

rmwkwok · September 15, 2022, 12:07pm

Hi, here is how you can debug it yourself.

Click “File” > “Open” > “public_tests.py” > go to function “compute_information_gain_test”, then you will see the following test which expects an information gain of 0.311278

    node_indexes = list(range(4))
    result = target(X, y, node_indexes, 0)
    assert np.isclose(result, 0.311278, atol=1e-6), f"Wrong information gain. Expected {0.311278} got: {result}"

There you see it gets only 5 samples involved

    X = np.array([[1, 0], 
         [1, 0], 
         [1, 0], 
         [0, 0], 
         [0, 1]])
    y = np.array([[0, 1, 0, 1, 0]]).T

Review the code provided in the exercise

    # Split dataset
    left_indices, right_indices = split_dataset(X, node_indices, feature)
    
    # Some useful variables
    X_node, y_node = X[node_indices], y[node_indices]
    X_left, y_left = X[left_indices], y[left_indices]
    X_right, y_right = X[right_indices], y[right_indices]
    
    # You need to return the following variables correctly
    information_gain = 0

Open a new code cell, copy the test data, and the provided code and run the data against the provided code.
refer to the exercise description, you need 3 entropy values to calculate the information gain, calculate them using your compute_entropy function. If you are not sure your compute_entropy is correct, please verify it by re-calculating them with the formula under section 4.1.
With the entropy values, calculate the information gain with your code work in the exercise, and calculate the information gain with the equation under section 4.3, and verify they are consistent and equal to the expected value of 0.311278. They should be different because you are having the error, but this process should give you hints on where the bug lies. Once you successfully debug it, copy the working code back to your exercise function and try the tests again.
Once you pass all the test, remove the code cell you created for this debugging work.

Good luck.

Raymond

rmwkwok · September 15, 2022, 1:01pm

The situation is very clear. Your code is able to produce expected result for one test but not for all of the public tests. We need to focus on the problem. The below error indicates your code fails one of the tests and that’s why I suggest a way for you to debug the code yourself. It will take some time but please try it.

Ghunemi · September 15, 2022, 2:22pm

At first I didn’t quite understand your answer at all.
But when I debugged literally as you suggested, I got it, I found the mistake and fixed.
Thank you a lot

rmwkwok · September 15, 2022, 3:30pm

I appreciate that. Being able to debug our own code is an invaluable asset

Raymond

Topic		Replies	Views
Week 4 Calculate Information Gain Exercise 3 - Getting Assertion Error Advanced Learning Algorithms week-4	2	34	March 19, 2025
Information gain calculation problem Advanced Learning Algorithms week-4	5	431	October 24, 2023
Information gain in practice lab of fourth week Advanced Learning Algorithms week-4	5	454	June 13, 2023
C2_W4_Decision_Tree_with_Markdown getting error with compute_information_gain Advanced Learning Algorithms week-4	2	601	July 25, 2022
Requesting help for C2_W4 Practice Lab Exercise 3 Advanced Learning Algorithms week-4	14	706	September 2, 2023

Compute_information_gain

Related topics