Part 2: Train Naive Bayes (failing check)

Sapiens · January 7, 2022, 2:12pm

This is what I entered:

{moderator edit - solution code removed}

The output is identical to what’s expected:

0.0
9165

But when I run the next check

# Test your function
w2_unittest.test_train_naive_bayes(train_naive_bayes, freqs, train_x, train_y)

I get the following error:

Wrong number of keys in loglikelihood dictionary. 
	Expected: 9165.
	Got: 148.

---------------------------------------------------------------------------
KeyError                                  Traceback (most recent call last)
<ipython-input-16-ca55b160668b> in <module>
      1 # Test your function
----> 2 w2_unittest.test_train_naive_bayes(train_naive_bayes, freqs, train_x, train_y)

~/work/w2_unittest.py in test_train_naive_bayes(target, freqs, train_x, train_y)
    369         for key, value in test_case["expected"]["loglikelihood"].items():
    370 
--> 371             if np.isclose(result2[key], value):
    372                 count_good += 1
    373 

KeyError: 'sunglass'

What am I missing?

arvyzukai · January 12, 2022, 4:11pm

Hi, Sapiens.

I think the problem with your code is because you construct your vocab from tweets (through simplified) and not from freqs dictionary as suggested in the instructions:

You can then compute the number of unique words that appear in the freqs dictionary to get your 𝑉 (you can use the set function)

I am guessing your code runs into memory constraints when the unittest is run.

parrotox · February 12, 2022, 9:41pm

I’m having tremendous difficulty figuring out how to get vocab. I’m pretty sure I understand the idea, but can’t figure out how to make the code do it. I think I’m supposed to get all the words without the 0 or 1, right?
Like if we have:
(‘word1’, 0), (‘word2’, 0)…etc.
Then my vocab would be:
‘word1’, ‘word2’…etc.,
right? How do i get Python do to that?

Edit: After working on it more, I think my previous assumption was wrong. I really need help understanding the vocab thing. What is that supposed to be?

Edit Again: I figured out a workaround that is clearly not what was meant for me to do, but it works, so I’m past that part of the assignment now. However, I’d still like to know how it was intended that I get vocab and V.

paulinpaloalto · February 13, 2022, 6:30pm

You are correct that you need to select the first element (the word) from each of the dictionary entries. Then you need to make them unique, since there are probably two entries for each word, right? They give you the hint of using the “set()” function in python. Check the documentation for that.

If you want to go “totally pythonic” here, you just feed the set function the output of a “list comprehension” that enumerates the first element of all the keys in the freqs dictionary. It’s just a single line of code.

Deepti_Prasad · March 22, 2024, 1:47pm

You are not suppose to share assignments codes on public post, it is against community guidelines.

Topic		Replies	Views
Wrong number of keys in loglikelihood dictionary NLP with Classification and Vector Spaces week-module-2 , week-module-3	5	648	December 8, 2022
Doubt in Week 2 coding assignments NLP with Classification and Vector Spaces week-module-2	9	108	October 22, 2024
C1_W2_Assignment loglikelihood wrong value NLP with Classification and Vector Spaces week-module-2 , assignment	24	142	June 26, 2025
Issue: Wrong number of keys in loglikelihood dictionary NLP with Classification and Vector Spaces week-module-2 , week-module-3	3	624	March 22, 2023
60/60 on assignment but trouble running on own machine due to NLTK bug NLP with Classification and Vector Spaces week-module-2 , week-module-3	8	757	February 24, 2022

Part 2: Train Naive Bayes (failing check)

Related topics