Have I correctly understood the Naïve Bayes' inference formula?

Micheal_Anderson · March 20, 2024, 3:24am

In the course titled ‘Log Likelihood, Part 1’ mentioned that:

This part of the formula, which I understand, adjusts for the effect of the total amount of positive and negative vocabulary to a certain extent rather than eliminating it entirely. To completely eliminate this effect, one would need to multiply it by the power of m.

TMosh · March 20, 2024, 4:12am

Which course are you referring to? You posted in “AI Discussions”, which isn’t about any specific course.

Deepti_Prasad · March 20, 2024, 4:50pm

I have moved it to the concern category

Micheal_Anderson · March 26, 2024, 1:10am

Natural Language Processing with Classification and Vector Spaces

Micheal_Anderson · March 26, 2024, 1:10am

ok，thanks

rmwkwok · March 26, 2024, 1:43am

Hi @Micheal_Anderson,

Like to have a discussion with someone not from this course?

I have a different understanding about this formula that, the ratio that you put in the red rectangular box is not optionally eliminable. In fact, it is the one thing that makes it bayes. In other words, whenever you happen to not seeing the ratio in the Bayesian context, it was only because it was equal to 1.

In this lecture, what it seems looking for is the probability that the sentiment is positive given the words, then compare it with the other probability that the sentiment is negative given the same set of words.

So you are literally just constructing these two probabilities. Below is an example that uses both the Bayes rule and the independence assumption which is the one thing that makes it “Naive” (together, you have the Naive Bayes ).

You have this formula, and you get another formula in the same way for the case of sentiment=neg, and you divide the positive case by the negative case then you get the formula that says “if the ratio is larger than 1, the prob that it is positive given the words is larger”

Cheers,
Raymond

rmwkwok · March 26, 2024, 1:55am

@Micheal_Anderson, rather than how to eliminate it, what do you think that term in your red rectangular box contributes to, given that the term is indeed the “Prior” term of the Bayes rule?

Micheal_Anderson · March 26, 2024, 2:00am

Thank you very much! Now I realize that I missed this crucial parameter in the initial step of the Bayes calculation process.

rmwkwok · March 26, 2024, 2:06am

You are welcome, @Micheal_Anderson!

You said:

If you are interested, it is actually a good chance to review this statement given now that we know the part comes from the “Prior” term of the Bayes’ rule. For example, we know the prior should have nothing to do with the observational data we are putting into the “likelihood” term, because this is what the name of “prior” means - “prior to the observational data”. While your choice of the word - “adjusts” is still a very nice capture of what it can do, more can be elaborated out of that.

But this thing is totally optional. Glad to know that you have found something from my last reply.

Cheers,
Raymond

Topic		Replies	Views
Baye's rule and Naive Bayes NLP with Classification and Vector Spaces week-2 , week-3	8	582	July 11, 2023
Week 2, video: Training Naïve Bayes NLP with Classification and Vector Spaces week-2 , week-3	2	514	November 29, 2022
A question about Naive Bayes score NLP with Classification and Vector Spaces week-2 , week-3	2	527	August 2, 2022
C3_W1 Naive Bayes Algorithm Question Probability & Statistics for Machine Learning &... week-1	4	490	July 19, 2023
What if we have the same frequency score on both a positive and a negative tweet NLP with Classification and Vector Spaces week-1	1	547	December 31, 2021

Have I correctly understood the Naïve Bayes' inference formula?

Related topics