Requesting help for C2_W4 Practice Lab Exercise 3

luiy0004 · December 15, 2022, 7:27am

Hello,

I’ve been getting a weird result from the 3rd exercise. I get this despite having the correct output.

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-42-2a50df79e115> in <module>
      9 
     10 # UNIT TESTS
---> 11 compute_information_gain_test(compute_information_gain)

~/work/public_tests.py in compute_information_gain_test(target)
    104 
    105     result = target(X, y, node_indexes, 1)
--> 106     assert np.isclose(result, 0, atol=1e-6), f"Wrong information gain. Expected {0.0} got: {result}"
    107 
    108     print("\033[92m All tests passed.")

AssertionError: Wrong information gain. Expected 0.0 got: nan

I tried to use the hint code, but the exact same error appeared. Then, I tried to copy the code snippet from public_tests.py to reproduce the error.

X = np.array([[1, 0], 
         [1, 0], 
         [1, 0], 
         [0, 0], 
         [0, 1]])
    
y = np.array([[0, 0, 0, 0, 0]]).T
node_indexes = list(range(5))
result = compute_information_gain(X, y, node_indexes, 1)
print(result)
assert np.isclose(result, 0, atol=1e-6), f"Wrong information gain. Expected {0.0} got: {result}"

But this code snippet passed and didn’t print the assertion error. So I am not sure what’s actually going on now. Can someone help me diagnose the issue?
Thank you very much for your help.

Yours sincerely,
LUIY0004

AbdElRhaman_Fakhry · December 15, 2022, 10:45am

Hi @luiy0004

Please make sure you follow the steps of the implementations and also you can follow these hint step

Thanks!
Abdelrahman

Elemento · December 15, 2022, 10:49am

Hey @luiy0004,
This kind of behaviour is mostly seen when your implementation uses one or more global variables, which you are not supposed to use, unless stated otherwise. Please ensure that you haven’t done the same. Let us know if this helps.

Cheers,
Elemento

luiy0004 · December 16, 2022, 2:18am

Dear Abdelrahman,

I tried following the hints and got the exact same error.

LUIY0004

luiy0004 · December 16, 2022, 2:27am

Dear Elemento,

I checked the hint variables against the other sections. The hint variables do not seem to be global variables. This is the hint code that produced the error:

# UNQ_C3
# GRADED FUNCTION: compute_information_gain

def compute_information_gain(X, y, node_indices, feature):
    
    """
    Compute the information of splitting the node on a given feature
    
    Args:
        X (ndarray):            Data matrix of shape(n_samples, n_features)
        y (array like):         list or ndarray with n_samples containing the target variable
        node_indices (ndarray): List containing the active indices. I.e, the samples being considered in this step.
   
    Returns:
        cost (float):        Cost computed
    
    """    
    # Split dataset
    left_indices, right_indices = split_dataset(X, node_indices, feature)
    
    # Some useful variables
    X_node, y_node = X[node_indices], y[node_indices]
    X_left, y_left = X[left_indices], y[left_indices]
    X_right, y_right = X[right_indices], y[right_indices]
    
    # You need to return the following variables correctly
    information_gain = 0
    
    ### START CODE HERE ###
    # Your code here to compute the entropy at the node using compute_entropy()
    node_entropy = compute_entropy(y_node)
    # Your code here to compute the entropy at the left branch
    left_entropy = compute_entropy(y_left)
    # Your code here to compute the entropy at the right branch
    right_entropy = compute_entropy(y_right)

    # Your code here to compute the proportion of examples at the left branch
    w_left = len(X_left) / len(X_node)

    # Your code here to compute the proportion of examples at the right branch
    w_right = len(X_right) / len(X_node)

    # Your code here to compute weighted entropy from the split using 
    # w_left, w_right, left_entropy and right_entropy
    weighted_entropy = w_left * left_entropy + w_right * right_entropy

    # Your code here to compute the information gain as the entropy at the node
    # minus the weighted entropy
    information_gain = node_entropy - weighted_entropy
    ### END CODE HERE ###  
    
    return information_gain

Is there something wrong with how I followed the hint code?

LUIY0004

SamReiswig · December 16, 2022, 2:33am

Hi @luiy0004

If we look at the failed test we can see that the function is producing a nan.
What is nan and how is it produced?

One way is division by 0.
After nan has been produced any arithmetic with nan is nan.

I would check everywhere division is performed in compute_information_gain() and compute_entropy() for where division by 0 can occur and try to prevent it.

luiy0004 · December 16, 2022, 2:50am

Dear SamReiswig,

Checking for where NaN could have been worked. Division by 0 was being tested in the compute entropy function. Strangely, that function passed all tests before I corrected it.
Thank you very much for your help.

Yours sincerely,
LUIY0004

Hui_Chen2 · December 16, 2022, 8:34pm

I have same issue as luly004…I went to check compute_entropy(y)…then only ‘/’ happens at
p1 = len(y[y==1])/len(y), I added a condition on top: if len(y)>0…but still doesn’t work…please help…thanks!

SamReiswig · December 17, 2022, 3:50am

Hi @Hui_Chen2 ,

I can’t tell what wrong from the description.
Can you send me your code in a direct message and I’ll take a look.

SiddharthPandey · January 26, 2023, 5:29pm

I am facing the same issue. Was it resolved?

SamReiswig · January 27, 2023, 3:21am

Hi @SiddharthPandey ,

Can you send me your code in a direct message and I’ll take a look.

SiddharthPandey · January 27, 2023, 8:56am

There’s another thread with the same issue. I fixed it with the hint (cover all the edge cases in previous function)

rubberduck203 · April 22, 2023, 11:27am

For anyone finding themselves here.
This happens when all the samples end up in one branch and none in the other, IF you have not accounted for there being no items in y in calculate_entropy().

The unit tests for calculate_entropy() should really test for that, but as of time of writing, don’t.

SandunMeesara · August 26, 2023, 11:26am

thank you! your answer help me a lot. I think this is the exactly what is going on.

Elemento · September 2, 2023, 5:54am

Hey @SandunMeesara,
Welcome, and we are glad that you could become a part of our community Thanks a lot for letting us know that your issue has been resolved.

Cheers,
Elemento

Topic		Replies	Views
Week 4 Calculate Information Gain Exercise 3 - Getting Assertion Error Advanced Learning Algorithms week-4	2	33	March 19, 2025
C2 W4 Practice Lab exercise 3 assertion error Advanced Learning Algorithms week-4	2	549	September 8, 2022
Issue with information gain exercise 3 Advanced Learning Algorithms week-4	3	764	September 23, 2022
Compute_information_gain Advanced Learning Algorithms week-4	5	789	September 23, 2022
Problem with the information gain exercise 3 Advanced Learning Algorithms week-4	4	764	May 10, 2023

Requesting help for C2_W4 Practice Lab Exercise 3

Related topics