Wrong values for loglikelihood dictionary

damianxchk · February 4, 2023, 12:56am

Was a stupid indentation issue. Sorry! I am getting this output. All good looks like!

freq_pos for smile = 47
freq_neg for smile = 9
loglikelihood for smile = 1.5577981920239676
0.0
9165

paulinpaloalto · February 4, 2023, 12:59am

Yes, those values agree with mine. Hope the grader also agrees!

Shahryar_Akbar · February 8, 2023, 3:22pm

Wrong number of keys in loglikelihood dictionary.
Expected: 9165.
Got: 11436
hey paul, can you help me i can’t seem to find my mistake i got wrong value of loglikelihood.
this is my code:

{moderator edit - solution code removed}

paulinpaloalto · February 8, 2023, 4:15pm

The problem is the way you have defined the vocab. You can’t just take the set of freqs, because those are tuples and they are all unique. That’s why you end up with too many entries. You need the unique words, which are the first entry in each of the tuples.

Also note that you’ve got “order of operations” issues on your p_w_pos and p_w_neg calculations. Try the following and watch what happens:

m = 5.
x = 1./1. + m
y = 1./(1. + m)

If you’re expecting x and y to have the same value, you’re in for a nasty surprise.

Oh, sorry, I didn’t read every line: you’ve also made the really classic mistake which most of the posts earlier on this thread are about. You just add 1 for each entry in the freqs dictionary, instead of using the actual frequency.

Shahryar_Akbar · February 8, 2023, 10:06pm

thanks i have corrected my vocab but still its showing 3 test failed i have tried for pair in freqs.keys():

{moderator edit - solution code removed}

but still its not working what is the problem in this code? its getting the total frequency of positive and negative words but still 3 tests are failed why?

paulinpaloalto · February 8, 2023, 10:33pm

If the loop is an enumeration of value, I don’t see value used in the body of the loop. How is pair defined with your code written that way?

With the enumeration over pair in freqs.keys(), that loop body should work, but note that there are lots of other ways later in the code to go off the rails also. Did you fix the order of operations thing I pointed out?

Shahryar_Akbar · February 9, 2023, 8:59am

I have solved the problem it was in freq_pos and negative as it was not counting the actual frequency of each word. thankyou so much for the time and effort paul.

msshhp · July 31, 2023, 11:24am

Thank you. It helps me. After review the equation (4) and (5), I see that you are right. Numerator has freq number, so denominator should has total freq.

Tithi_Sarkar · September 20, 2023, 6:08pm

Hi Paul,
I have added these print statements and got the same output as yours but it’s still failing 3 test cases.
if word == ‘smile’:
print(V,D, D_pos, D_neg, N_pos, N_neg)
print(f"freq_pos for smile = {freq_pos}“)
print(f"freq_neg for smile = {freq_neg}”)
print(f"loglikelihood for smile = {loglikelihood[word]}")

output :

9165 8000 4000.0 4000.0 27547 27152
freq_pos for smile = 47
freq_neg for smile = 9
loglikelihood for smile = 1.5577981920239676
0.0
9165

9165 8000 4000.0 4000.0 27547 27152
freq_pos for smile = 47
freq_neg for smile = 9
loglikelihood for smile = 1.5577981920239676
Wrong values for loglikelihood dictionary. Please check your implementation for the loglikelihood dictionary.
9165 20 10.0 10.0 27547 27152
freq_pos for smile = 47
freq_neg for smile = 9
loglikelihood for smile = 1.5577981920239676
Wrong values for loglikelihood dictionary. Please check your implementation for the loglikelihood dictionary.
9165 15 10.0 5.0 27547 27152
freq_pos for smile = 47
freq_neg for smile = 9
loglikelihood for smile = 1.5577981920239676
Wrong values for loglikelihood dictionary. Please check your implementation for the loglikelihood dictionary.
12 Tests passed
3 Tests failed

Not sure what I am missing. Thanks for your all your posts.

reinoudbosch · September 30, 2023, 12:24pm

Hi Tithi_Sarkar,

Did you manage to resolve this issue? If not, feel free to send me your notebook as an attachment to a direct message so I can have a look what is going on.

Elisa_Vera · September 30, 2023, 1:49pm

Hey sorry to pile on but I have a similar issue. I did the checks that Paul wrote and get the correct values for freq_pos and neg for smile as well as the correct values for N_pos and N_neg, however my loglikelihood is still incorrect and both my numerator and denominator use parentheticals so I don’t think there is an order of operations issue in the p_w_pos and neg. Have any recommendations for other places to look?

reinoudbosch · September 30, 2023, 3:12pm

Hi Elisa_Vera,

I find it hard to say without looking at your code. Feel free to send me your notebook as an attachment to a direct message. I can then have a look.

reinoudbosch · September 30, 2023, 5:04pm

Hi Elisa_Vera,

Look carefully at the comment in train_naive_Bayes that states the following:
# calculate V, the number of unique words in the vocabulary
Can you see where the problem lies?

paulinpaloalto · September 30, 2023, 5:14pm

Thanks very much for watching this thread Reinoud! Sorry for my lack of response, but I was traveling the last 2 weeks and had a hard time keeping up on the forums.

Elisa_Vera · September 30, 2023, 5:15pm

*facepalm

Thank you!

reinoudbosch · September 30, 2023, 5:25pm

No facepalm needed we all have bugs in our code. You are welcome.

reinoudbosch · September 30, 2023, 5:27pm

No problem Paul. I hope you had a good time traveling.

Guido_Carballo · November 1, 2023, 2:12am

LOL, well, @Vincent_Rupp when I read your reply to @paulinpaloalto I skip his explanation and tried to understand the sentence “N_pos isn’t the total positive words; it’s the total frequency for all positive words.” and got confused. So I start looking back and forth on the quiz text and suddenly catch the error and now @paulinpaloalto 's explanation seems very easy to understand. As always, when you realized the error, you can’t understand how you over look that. As explained by @paulinpaloalto the error is adding 1 instead of the times the word is in each tweet. Thank you both, because I spend all day today doing a bunch on debugging for around 4 hrs and couldn’t find any error.

Rishav_Kumar_Paramh1 · November 12, 2023, 9:07pm

Thanks @paulinpaloalto I was commiting the same mistake.

Kazeem_Enitan_Bello · January 29, 2024, 3:49pm

I seems to be missing something, I am getting my V which is len(vocab) to be 9161 instead of 9165. I do not know where I made the mistake. Thanks in advance

Topic		Replies	Views
'Wrong values for loglikelihood dictionary.' Error on C1_W2 assignment NLP with Classification and Vector Spaces week-2 , week-3	6	690	May 17, 2023
Can't get past UNC_C2: train_naive_Bayes NLP with Classification and Vector Spaces week-2 , week-3	8	681	December 19, 2022
C1_W2_Assignment: Wrong values for loglikelihood dictionary NLP with Classification and Vector Spaces week-2 , week-3	1	518	December 13, 2023
I can't find any solution NLP with Classification and Vector Spaces week-2 , week-3	2	543	October 7, 2022
C1_W2_exercise2 NLP with Classification and Vector Spaces week-2	14	227	August 6, 2024

Wrong values for loglikelihood dictionary

Related topics