C2_W4_Decision_Tree_with_Markdown split_dataset error

Martin_Wessman · October 4, 2023, 12:18pm

Hi,

I’m getting the split to work, and it gives the “Expected output”.
But my function fails on the assert length of left and right splits:

AssertionError Traceback (most recent call last)
in
31
32 # UNIT TESTS
—> 33 split_dataset_test(split_dataset)

~/work/public_tests.py in split_dataset_test(target)
44 assert type(right[0]) == int, f"Wrong type for elements in the right list. Expected: number got: {type(right[0])}"
45
—> 46 assert len(left) == 2, f"left must have 2 elements but got: {len(left)}"
47 assert len(right) == 3, f"right must have 3 elements but got: {len(right)}"
48

AssertionError: left must have 2 elements but got: 3

Help would be appreciated
Best regards
Martin

TMosh · October 4, 2023, 6:05pm

The Hint code for this function gives you most of the implementation.

The only bit you have to implement is an if-statement that does the part I’ve indicated with an arrow:
“check if the value of X at index [i] and [feature] == 1”.

Martin_Wessman · October 5, 2023, 11:31am

Thanks for quick reply.
I saw the hint and the split works, see below figure. The actual splits are equal to “Expected Output” - red boxes in the below fig.

But the unit tests fail at row 46 where the unit test is “assert len(left) == 2”. But from the “Expected output” it seems that len(left) is 3…

Best regards
Martin

TMosh · October 5, 2023, 9:21pm

Sorry, I’m a little bit stuck on investigating this issue, because I’ve temporarily lost access to the MLS course notebooks.

Hopefully DLAI can fix it soon.

TMosh · October 5, 2023, 11:45pm

What’s happening here is confusion about how the tests are numbered.

There are two cases in the notebook, they’re labeled Case 1 and Case 2. Your code passes both of those. Those tests are run first.

Then the split_dataset_test() function has three other test cases. They’re numbered 1, 2, and 3.
Your code is failing its Case 1 test.
Here are the values for that test, maybe you can work this out by hand and see where your code malfunctions. The test case code is in the public_tests.py file.

TMosh · October 5, 2023, 11:51pm

I think this is the test that fails (printing X, node_indices, and feature):

Case 1 there has five examples, it’s testing feature 2 (the last column), and there are only two examples that have a 1 in that position.

Martin_Wessman · October 6, 2023, 11:22am

Fixed,
It was a silly bug from my side of course. I used the global variable X_train in the function instead of the local variable X, which worked for many cases, but not for the test 3 test data.
Thanks a lot for the support!
/Martin

TMosh · October 6, 2023, 2:53pm

Thanks for your report!

Topic		Replies	Views
[URGENT] Unable to progress in Week 4 due to assertion error in Exercise 2 of Practice Lab: Decision Trees Advanced Learning Algorithms week-4	6	299	February 16, 2024
C2 W4 Decision tree with markdown "rigth" instead of "right" error in public_tests.py? Advanced Learning Algorithms week-4	2	514	September 6, 2022
Unit test case failure in C2_W4_Decision_Tree_with_Markdown Advanced Learning Algorithms week-4	6	86	October 7, 2024
Decision Tree lab - split_dataset unit test error Advanced Learning Algorithms week-4	5	498	February 15, 2023
Error in get_best_split tests in C2_W4_Decision_Tree_with_Markdown Advanced Learning Algorithms week-4	16	812	June 5, 2023

C2_W4_Decision_Tree_with_Markdown split_dataset error

Related topics