Hello,
Can someone offer an explanation (preferably proof) as to why the 2nd term in the denominator should be “V_class”, which is the number of unique words in the vocabulary ?
Here’s the equation -
P(w_i/class) = (freq(w_i, class) + 1)/(N_class + V_class)
Any insight would be helpful.
Thanks
Ani