Log_prob in sampling_decode function

lbaiao · March 1, 2023, 11:22am

The log_prob variable returned by the sampling_decode function is the log probability of the last symbol. Why is it considered to be the log probability of the whole sentence?

arvyzukai · March 1, 2023, 2:26pm

That is a very good question and I think you found a mistake I will report it for fixing

lbaiao · March 1, 2023, 7:16pm

Out of curiosity, would the sentence’s probability be the product of the symbols’ probabilities?

arvyzukai · March 2, 2023, 7:04am

Yes, sentence probability would be the product of the symbols’ probabilities, but since we have log probabilities in that assignment, the sentence log probability should be the sum of symbols’ log probabilities.

As you know from the lectures, this is just a part of the picture, since short sentences would have bigger probabilities and people came up with ideas how to account for that.

So it is part of design questions how do you want to go about that, but for sure, the sampling_decode should not output the last symbol’s probability.

Topic		Replies	Views
To get the last token from the log probabilities NLP with Attention Models week-1	4	565	April 22, 2023
C4W1 - stuck in Ex6 NLP with Attention Models week-1	12	846	May 30, 2023
Sampling_decode Assignement function NLP with Attention Models week-1	1	561	July 1, 2022
In C10 all my translated sentences have double .. or ! at the end NLP with Attention Models week-1	6	426	November 3, 2023
Sampling_decode stuck in loop NLP with Attention Models week-1	2	586	July 7, 2023

Log_prob in sampling_decode function

Related topics