Question on Top P Sampling ( Generative Configuration)

rukshanj · July 25, 2023, 12:01pm

Its mentioned when p=0.3, it will pick the; 0.1 and 0.2 (because they add up to .3).

What if probabilities are as eg:

apple 0.28
banana 0.10
cake 0.02
etc.

If its top p = 0.3;
would it pick all three?

as apple + banana = 0.38 > 0.3 as well as
apple + cake = 0.3

gent.spah · July 25, 2023, 1:24pm

Probably not, it will pick only the 2.

Kyle_Evans · August 2, 2023, 6:44pm

Per my reading, the top p picks the one with cumulative probability greater or equal to the top p value. so in this case, 0.38 will be chosen because it is greater than the top p value(0.38>0.3)

aakashs07 · October 3, 2023, 11:44am

Based on my reading from the following resources:

I also think it should be first two tokens - apple + banana. The top_p parameter sets the threshold on cumulative probability of tokens so that we can restrict tokens from being considered for next token prediction.

nen · November 1, 2023, 6:01pm

I think there is a mistake in that video.

Top-k sampling refers to selecting the next token randomly from a specified number, k, of tokens with the highest probabilities.

Top-p sampling refers to selecting the next token randomly from the smallest set of tokens for which the cumulative probability exceeds (or is equal) a specified value, p. and that’s why 0.1 and 0.2 where selected

macandcheese · June 30, 2024, 4:50am

The video says the cumulative probability should be <=p, but by logic >=p makes more sense. Because we want the o/p to be creative, but we are limiting its randomness by putting probability constraints. so >=p should be correct right?

Topic		Replies	Views
Both Top P And Top K non zero? How does the model choose Generative AI with Large Language Models week-1	1	498	June 29, 2023
Multiple sets with same probabilities in the output of softmax? Generative AI with Large Language Models week-1	2	324	October 17, 2023
Lab 1 - Fine Tuning Generative AI with Large Language Models week-1	2	492	July 17, 2023
Sampling novel sequence using np.random.choice Sequence Models coursera-platform	4	482	June 2, 2023
Week 1 Sampling Novel Sequences Sequence Models coursera-platform	4	595	June 20, 2024

Question on Top P Sampling ( Generative Configuration)

Related topics