C1_W2_Assignment key error

jfeller35 · December 1, 2024, 6:27pm

I keep running across a key error when running the train_naive_bayes() function.

Don’t know if you can see the image, but it seems like it cannot find the value
(‘noo’, 1) but (‘noo’,0) will be found in freq dictionary. It seems to me like in the count_tweets() function that creates the dictionary we should be defining both (word,1) and (word,0) in the dictionary when a new word is found (in this case ‘noo’) even if we set one of those equal to zero. But when I try that the unit test for that part of the assignment fail.

My workaround was to create a try/except block to assign freq_pos/freq_neg to zero when this KeyError is thrown, but then the unit tests fail saying wrong value for log likelihood dictionary.

paulinpaloalto · December 1, 2024, 6:29pm

Not every word will have both sentiment values. Your code needs to handle that. The “get()” method on a dictionary is a nice clean way to deal with potentially missing keys. Or you can use “if” clauses.

It looks like that’s not the only issue in your code, e.g. why are you dividing the frequencies by D_pos and D_neg? Where does it say to do that?

jfeller35 · December 1, 2024, 6:42pm

Ah yes I was not aware of the get() method for python dictionaries. That is very useful. And yes I was thinking of a D_pos and D_neg were probabilities when they are just counts. But seems like the same unit tests are still failing even after I correct for that.

jfeller35 · December 1, 2024, 6:51pm

Okay I figured it out. I was using D_pos and D_neg in the log likelihoods instead of N_pos and N_neg. Need to pay attention to the differences in those counts. Thank you for your quick response.

Topic		Replies	Views
Train_naive_bayes NLP with Classification and Vector Spaces week-module-2 , week-module-3	9	670	June 27, 2023
Wrong number of keys in loglikelihood dictionary NLP with Classification and Vector Spaces week-module-2 , week-module-3	5	661	December 8, 2022
C1_W2_Assignment_Exercise 2 - train_naive_bayes NLP with Classification and Vector Spaces week-module-2	18	465	April 20, 2024
C1_W2_Assignment loglikelihood wrong value NLP with Classification and Vector Spaces week-module-2 , assignment	28	215	August 27, 2025
Part 2: Train Naive Bayes (failing check) NLP with Classification and Vector Spaces week-module-2 , week-module-3	4	780	March 22, 2024

C1_W2_Assignment key error

Related topics