NLP C3W2 Assignment - Error in unit tests for data_generator function

map9 · March 6, 2022, 8:15pm

Having obtained the “expected output” for this function, I get the following error in the unit tests:

data_generator error

It seems to me the error is in the unit test code, where it is apparently requesting the shape of a list object and not of an array.

balaji.ambresh · March 7, 2022, 6:53am

You are supposed to return a tuple of 3 numpy arrays.

See this comment in the function data_generator
# convert the batch (data type list) to a numpy array

map9 · March 7, 2022, 8:23am

…which is what I’m doing, no?

balaji.ambresh · March 7, 2022, 8:54am

Please click my name and message your notebook as an attachment.

map9 · March 7, 2022, 10:37am

I will do that. Many thanks.

Not sure how to attach my notebook to the message…

balaji.ambresh · March 7, 2022, 11:05am

Looked at the notebook. There was an extra -1 term when computing the number of number of elements for pad inside the function data_generator

balaji.ambresh · March 7, 2022, 11:32am

@Community-Team @Mubsi @paulinpaloalto
There are bugs in w2_unittest.py inside test_data_generator.
failed_cases is a list. So, instead of invoking it like a function, use failed_tests.append

map9 · March 7, 2022, 12:53pm

Got it.

Many thanks for your help!

Mubsi · March 7, 2022, 6:10pm

Thanks @balaji.ambresh, noted.

drew_Frances · May 6, 2022, 4:18pm

@Mubsi @balaji.ambresh I am also having problems in the unit_tests: I get 24/6 tests passed. Initially I get:

(DeviceArray([[49, 50, 51, 52, 53, 54, 55, 56, 57, 1],
[50, 51, 52, 53, 54, 55, 56, 57, 48, 1]], dtype=int32),
DeviceArray([[49, 50, 51, 52, 53, 54, 55, 56, 57, 1],
[50, 51, 52, 53, 54, 55, 56, 57, 48, 1]], dtype=int32),
DeviceArray([1, 1], dtype=int32))

Expected output

(DeviceArray([[49, 50, 51, 52, 53, 54, 55, 56, 57, 1],
[50, 51, 52, 53, 54, 55, 56, 57, 48, 1]], dtype=int32),
DeviceArray([[49, 50, 51, 52, 53, 54, 55, 56, 57, 1],
[50, 51, 52, 53, 54, 55, 56, 57, 48, 1]], dtype=int32),
DeviceArray([[1, 1, 1, 1, 1, 1, 1, 1, 1, 1],
[1, 1, 1, 1, 1, 1, 1, 1, 1, 1]], dtype=int32))

Element with index 2 in the output tuple has incorrect shape. It should be (batch_size, max_length).
Expected (2, 10).
Got (2,).

I am assuming element with index 2 is the mask_np_arr?
I use np.where() to create the example_mask

My lab id is phpssgbp

Thanks,
Drew

balaji.ambresh · May 6, 2022, 4:28pm

@drew_Frances Please click my name and message your notebook as an attachment.

balaji.ambresh · May 6, 2022, 7:33pm

@drew_Frances Computation of example_mask is incorrect.
Please be aware of the parameter type when using np.where. Your implementation would be correct if the type of tensor_pad was a numpy array. Unfortunately, it’s a list. As a result, instead of vectoring the operation, it’s just a simple comparison. Since the list is not equal to 0, you’ll get 1 as the result.

drew_Frances · May 6, 2022, 9:18pm

Hi Balaji.ambresh:

Thanks for the answer. That fixed it. This explains why my test code works: I was using np.array.

I seem to be making this mistake a lot. Is there a way that I would run the code locally and have some tool pick up these type errors? Does numpy use type hints?

Again, thanks for the help!

Cheers,
Drew

balaji.ambresh · May 7, 2022, 6:54am

Try these steps to run a notebook locally:

Download the assignment by clicking Lab Files and then downloading all files.
Install the correct version of libraries. Run !pip list > libs.txt on the coursera jupyter environment to get this information.

See numpy.typing for numpy type hints.

drew_Frances · May 8, 2022, 4:30pm

@balaji.ambresh:

Thanks for the answer!

Cheers,
Drew

Topic		Replies	Views
Course 3 Week 2 Data Generator Some Tests Fail NLP with Sequence Models week-2	1	566	May 6, 2022
Data generator exercise NLP with Sequence Models week-2	2	508	January 21, 2023
Is there a problem with the coding assignment? NLP with Sequence Models week-2	1	620	July 6, 2022
Things spotted in C3_W1_Assignment (shape error and compute accuracy denominator) NLP with Sequence Models week-1	1	478	May 23, 2023
I got a wrong shape problem in the Week1 assignment Q2 NLP with Sequence Models week-1	4	612	August 13, 2023

NLP C3W2 Assignment - Error in unit tests for data_generator function

Related topics