Week2 Emoji_v3a low test accuracy but passed the grader

gbdantoni · February 26, 2022, 2:27pm

My code for Emoji_v3a passes all of the internal tests and shows high accuracy and low loss values when training the Emojify_V2 model but consistently gets a test accuracy value around 0.5. I carefully went through my code to compare my outputs against the expected outputs but couldn’t find any problems so I submitted the assignment, hoping the grader would identify which section of my code was incorrect, but the grader gave me 100/100 and said all functions are correct. Should I really expect the test accuracy to be between 80% and 95% and if so, where should I look to see what’s wrong?

TMosh · February 27, 2022, 12:10am

I get 80% on the test set when I run my noteboook.

What loss and accuracy do you get from cell with the model.fit() results?

Please post the values from the last few iterations.

gbdantoni · February 27, 2022, 12:27am

The final loss and accuracy are: 0.0644 and 0.9848.

TMosh · February 27, 2022, 12:35am

Those are the training set results, they look fine.
I’m not sure why you’d have that much lower accuracy on the test set.
I’ll think about it for a bit, and reply again if I have any ideas.

Jiachen_Sun · July 7, 2022, 8:09pm

I got a 0.625 test accuracy, but training accuracy also looks totally fine.

xl0 · July 15, 2022, 12:40pm

Same issue here, test accuracy between 65% and 75%:

2/2 [==============================] - 0s 3ms/step - loss: 1.4789 - accuracy: 0.6607

Test accuracy =  0.6607142686843872

Train accuracy gets up to 100%.

The model passes both the test and the autograder, so I can assume it’s correct. I did set the embedding layer as non-trainable.

Any ideas on what could be wrong?

xl0 · July 15, 2022, 2:23pm

Found the issue - the test dataset CSV file has been mishandled in the past, and the strings contain \t (TAB character) at the end. The training set does not suffer from this issue.

I used .split(’ ') to separate the words, which meant the last word of the sentence had a \t stuck to it and was not in the dictionary. The solution is to always use .split() without argument, which strips whitespace by default.

With correctly split words, I’m getting 82% accuracy, which is within the expected range, but surprisingly, still below the much simpler average vector model.

anon76241992 · October 14, 2023, 9:12am

I come here for the same problem. I got 78% for test accuracy. Maybe this is because overfitting due to the use of embeddings??

anon76241992 · October 14, 2023, 9:20am

Okay, I just ran the notebook again and now I got 85% acc at test set. Maybe the model is facing a very high variance problem.

Topic		Replies	Views
Week 2 A2, accuracy value lower than expected Sequence Models week-2	2	246	January 10, 2024
Week 2 - Emojify - Emoji_v3a Sequence Models	5	957	August 7, 2021
DLS C5-W2 A2 Emojify - Low accuracy on test set Sequence Models	1	558	October 11, 2021
Emojify! Assignement - Test accuracy range not met with LSTM Sequence Models	1	556	April 21, 2022
C5 W2 A2 Emojify Sequence Models	8	748	October 31, 2022

Week2 Emoji_v3a low test accuracy but passed the grader

Related topics