Week 2 Assignment | Gumbel Distribution Use-Case

Elemento · December 30, 2022, 6:57pm

Hey Guys,
Towards the end of Week 2’s Assignment, it introduces the Gumbel Distribution, and uses it to predict the characters. However, our model produces results in the form of probability distributions over the entire vocabulary.

So, instead of using the function gumbel_sample, why don’t we simply pick the character which is the most likely one? And if we want to generate different sequences for the same prefix, then we can pick characters as per the probability distributions produced, i.e., each character will be picked as per it’s probability.

I don’t understand the use case of the function gumbel_sample in this, like does it produce better results anyhow, and if yes, then how?

Cheers,
Elemento

arvyzukai · December 31, 2022, 11:49am

Hey @Elemento

One extreme - to pick the most likely character.
Another extreme - to pick according to all the probabilities from the model.
One more extreme - to pick totally randomly.

Temperature lets you pick between these extremes:

if you set temperature = 0, then you pick always the same most likely character (first case)
if you set temperature = 1, then you pick more randomly
if you set temperature = 10, then you pick almost uniformly randomly

You can try by setting the parameter yourself.
What gumbel_sample does it adjusts the probabilities for the prediction.

Cheers.

Topic		Replies	Views
Why take random sample for prediction and not the sample with maximum probability Sequence Models coursera-platform	3	551	December 25, 2022
Why y for the probability distribution Sequence Models coursera-platform	3	601	May 17, 2021
Sampling Novel Sequences Sequence Models coursera-platform	6	534	January 13, 2023
Week 1 assignment 2 - indices Sequence Models coursera-platform	5	519	March 12, 2023
Dinosaurus_Island_Character_level_language_model np.random.choice Sequence Models week-module-1 , coursera-platform	2	425	January 6, 2024

Week 2 Assignment | Gumbel Distribution Use-Case

Related topics