Issue: Wrong number of keys in loglikelihood dictionary

CHEN_Yixi · March 22, 2023, 12:37pm

Hi,

I encountered an issue when I was trying to finish the train_naive_bayes function in C1_W2_Assignment. Error information is shown below. It showed that I got the wrong number of keys in the dictionary.

I supposed that the error may be caused by the wrong number of element in vocab. In my answer, vocab was gained from vocab = set([pair[0] for pair in freqs.keys()]), which got all unique words in the freqs dictionary.

I also tried to run vocab = set([pair[0] for pair in freqs.keys()]) in a new code cell, but its output’s length was still 9162 instead of 9165.

I wonder how this issue happened and how I can fix it.

Thanks!

ai_curious · March 22, 2023, 1:11pm

When I took this class a year ago I experimented with running locally. I had a different version of nltk. My output for the number of keys was 9162. My notes at the time say ‘issue is lower casing of some emoji’s as of NLTK 3.4.5’

If you search in the forum, you will find at least one other related thread. If you are running locally, you can try to match the NLTK version running in the Coursera (or Google?) cloud. If you are using the course-provided environment, this discrepancy might be caused by a package update that is breaking the unit test. HTH

CHEN_Yixi · March 22, 2023, 1:18pm

Thanks for your reply! I tried my code in the course-provided environment and the issue was fixed.

ai_curious · March 22, 2023, 5:41pm

Thanks for the feedback and glad you resolved it. For others reading this thread in the future, here is a related one with some more details and examples of what causes these numerical discrepancies…

Topic		Replies	Views
Wrong number of keys in loglikelihood dictionary NLP with Classification and Vector Spaces week-2 , week-3	5	647	December 8, 2022
C1_W2_Assignment loglikelihood wrong value NLP with Classification and Vector Spaces week-2 , assignment	15	76	February 24, 2025
Doubt in Week 2 coding assignments NLP with Classification and Vector Spaces week-2	9	91	October 22, 2024
Part 2: Train Naive Bayes (failing check) NLP with Classification and Vector Spaces week-2 , week-3	4	760	March 22, 2024
'Wrong values for loglikelihood dictionary.' Error on C1_W2 assignment NLP with Classification and Vector Spaces week-2 , week-3	6	690	May 17, 2023

Issue: Wrong number of keys in loglikelihood dictionary

Related topics