Q9 previous_n_gram should be a list

gkouro · March 1, 2023, 12:15pm

I was trying the example of ‘’ after ‘cat’ with the function of q9 and I was getting a different probablity than doing it by hand. I realised that when you set previous_n_gram = tuple(previous_n_gram) and previous_n_gram is a string then the output is a tuple of all the characters. So it must be that previous_n_gram is always a list.
In the example in the next cell you try to estimate the probability of ‘cat’ after ‘a’ which inside the tuple() it output (‘a’,) anyway.
So it is better in the example to set ‘a’ in ‘[’,‘]’ like this:

tmp_prob = estimate_probability("cat", ["a"], unigram_counts, bigram_counts, len(unique_words), k=1)

Same in the following cell:

estimate_probabilities(["a"], unigram_counts, bigram_counts, unique_words, k=1)

Just saying

arvyzukai · March 1, 2023, 3:07pm

You’re correct - nice catch! It will be corrected

Topic		Replies	Views
C2_W3_Assignment Exercise 9 wrong? NLP with Probabilistic Models week-3	1	618	January 16, 2022
For the Exercise 9, anyone gets below answer? 0.1111 rather than 0.3333 NLP with Probabilistic Models week-3	4	607	September 19, 2022
Bug in inference for the assignment 3? NLP with Probabilistic Models week-3	5	501	May 1, 2023
Q10 - calculate_perplexity NLP with Probabilistic Models week-3	8	748	December 29, 2022
C2_W3_Assignment: Exercise 9 - estimate_probability NLP with Probabilistic Models week-3	3	319	February 9, 2024

Q9 previous_n_gram should be a list

Related topics