C2W4 Assignment has wrong expected outputs and confusing output layer

fsamie · August 22, 2024, 9:04am

Hi,
There are some inconsistencies in the notebook by the ‘Expected Output’.

After creating the train and validation datasets, the expected output is stated as

Images of train generator have shape: (None, 28, 28)

But it should have been:

Images of train generator have shape: (None, 28, 28, 1)

The dataset has 24 classes as is also mentioned in the notebook. Now, add an output layer with 24 units, it would pass one unit-test and fail another unit-test. If you change it to 26 units, it would fail the first unit-test and pass the second unit-test.
The only solution is a certain loss function, but there is no hint for that. It was not mentioned at all. The whole story with 24 or 26 classes is really confusing. It would be nice to somehow clarify it. I have to add that fitting the model works just fine with either of cases (24 classes+loss function1 or 26 classes+loss function2) and you would reach the same level of accuracy.

balaji.ambresh · August 22, 2024, 11:46am

Thanks for bringing this up. The staff have been informed to fix this assignment. Please set the number of neurons in the output layer to the number expected by the grader for now.

See this ungraded lab to observe the following:

Labels are integer encoded.
Is a multi-class classification problem.
Output activation function and loss function mentioned here are match the ones in the assignment.

Take a look at the Sign Language MNIST dataset and notice the following:

The American Sign Language letter database of hand gestures represent a multi-class problem with 24 classes of letters (excluding J and Z which require motion).

The original dataset has a different mapping since J is missing. On the other hand, the dataset prepared for this assignment doesn’t have any blank folders. Since this could lead to some confusion, do wait for the staff to address your concern.

gtvid · August 22, 2024, 1:14pm

Hi,

Thanks for taking care of this. FYI, i tried to:

(i) first coding and running the whole notebook cnsidering a 24-classes problems,
*it allowed me to compile, fit and plot train and validation accuracy,
*returned errors on unittests which require a (none, 26) output
(ii) then modify the create model function for a 26-classes problem ,
*it didn’t help getting the grades, here is a sceenshot

image777×667 20.6 KB

fsamie · August 22, 2024, 1:22pm

I guess I know the problem.
Just a few tips:

After getting your grade, you see two items (as shown in your figure, too): train_val_datasets and create_model. Click on them to see why the grader was not happy with your solution.
As balaji suggested, use 26 as your output layer size. In order to pass the unittests, see ungraded lab that balaji mentioned. There you will fine a clue about the loss function.

I hope you will find the solution and pass the test.

gtvid · August 22, 2024, 1:53pm

Thanks very much for your help
If it is allowed, I think explaining how and why that works would be beneficial (something close to braodcasting by keras?), even though maybe a bit beyond the scope of this particular assignment

Deepti_Prasad · August 22, 2024, 2:24pm

can you click on the two grade cell mentioned in your grader output, so it shows why you failed the grader. it provides information on why you didn’t clear the grader, so we can know where codes must have gone wrong @gtvid

Also because the course was recently updated so to know you are not which version, can you share screenshot of the grade cell codes by personal DM. Please don’t post codes here.

fsamie · August 22, 2024, 3:10pm

Actually the code would work just fine with the other loss function and the model would be trained. But for some reason the unittest expect to have this particular loss function.

gtvid · August 22, 2024, 3:25pm

The model gets trained just fine when the data label is ‘categorical’ and ‘cat…cross’ loss is used, but only if you use the precise number of output units in the model with regards to actual classes.

However, the other setup, with data label ‘int’ and the sparse loss function works even with more units in the output layer than actual classes. Trying to figure out why

giangdt · August 23, 2024, 8:26am

Here is the screen the grader failed. The main reason is that the test data generation is incorrect. Each label data is a one-hot vector of 26 elements while the grader generates an integer in the range [0, 26]. (see unittests.py line 186)

In fact, the grader still has many bugs. When I fixed this bug, another test case appeared that required the loss function to be ‘spar…cat’ while in reality the loss function used to fit the model was ‘cat…’.

Another problem is the ambiguity between the data having only 24 classes but the grader in the next test cases requires output shape=(None, 26) (see unittests.py line 202)

ai_curious · August 23, 2024, 1:27pm

@Community-Team I have seen a similar learner experience elsewhere recently. Seems like notebook code, in-notebook unit tests, and grader are maybe not sufficiently regression tested to ensure consistency before code changes are released. Perhaps unified regression testing could surface these self-inflicted issues, as well as code broken by third party python/tensorflow/keras etc package revisions that are not backwards compatible.

Deepti_Prasad · August 23, 2024, 3:46pm

this has been already reported to the staff. thank you @giangdt.

they are working on it.

Deepti_Prasad · August 29, 2024, 5:52pm

hi @giangdt

issue resolved as per staff. You will get the updated lab when you launch the lab. They you can copy your previous solution to the new one before submitting.

Thank you reporting this.

Regards
DP

giangdt · August 30, 2024, 2:56am

Thanks for support. My work sumitted successfully.

Deepti_Prasad · August 30, 2024, 3:19am

@fsamie and @gtvid I hope you have also resolved your issue by following the comment recently.

Goodluck

Topic		Replies	Views
C2W4 Multi-class Classification Sign Language Convolutional Neural Networks in TensorFlow week-4	3	112	August 29, 2024
C2W4 Class Typo - Needs to be '24', not '26' Convolutional Neural Networks in TensorFlow week-4	3	37	September 10, 2024
TF1 - C2 - W4 - Unexpected model TF Developer Professional Certificate Resources	7	277	September 16, 2023
C2W4 assignment, training model Convolutional Neural Networks in TensorFlow week-4	7	361	October 30, 2023
C2W4 assignment error in model.fit Convolutional Neural Networks in TensorFlow week-4	19	741	July 11, 2023

C2W4 Assignment has wrong expected outputs and confusing output layer

Related topics