Intuitive understanding of Perplexity

Maxim_Afteniy · April 24, 2022, 10:08pm

Hi, I have trouble understanding the Perplexity concept. Namely, how it captures the Language model “quality”, do you have any intuitive explanation to this? And why it’s mentioned to be the same as Entropy? Entropy measures just the randomness?
Thanks!

alvaroramajo · April 26, 2022, 7:25pm

Hi, @Maxim_Afteniy!

Perplexity is a measurement of how well a probability model predicts a sample. For NLP, perplexity is a commonly used metric for model evaluation.

Say that you have a test set with well written sentences. If your model is good enough, it will give those samples a high probability (low perplexity), which means it it not surprised or perplexed to see them.

About the second question, perplexity can also be defined as the exponential of the cross-entropy:
PP(W)=2^{H(W)}=2^{-\frac{1}{N}\log_2{P(w_1, w_2, ..., w_N)}}

For a deeper insight about it, check this article

Topic		Replies	Views
Incorrect quize NLP with Probabilistic Models week-3	7	15	February 2, 2025
NLP C2_W3_EX10-Perplexity NLP with Probabilistic Models week-3	2	20	April 23, 2025
Test data and perplexity NLP with Probabilistic Models week-2	3	522	May 29, 2023
C3_W1 - understanding Calculating perplexity lab NLP with Sequence Models week-1	1	25	January 7, 2025
Week 3, quiz 10 NLP with Probabilistic Models week-3	1	560	May 15, 2023

Intuitive understanding of Perplexity

Related topics