C3_W1 definition of probability

Elzear_Young · December 30, 2022, 11:16pm

In the chapter about anomaly detection, where Prof. Ng talks about the Gaussian distribution, what exactly does he mean by “the probability of x”?

When I hear the phrase “the probability of x” I’d usually think of the probability that a random number, according to the given probability distribution, is equal to x. In the case of the normal distribution this would always be equal to 0, which isn’t helpful.

In this context, he just seems to plug x into a density function of a probability distribution, and call it the probability of x. How does this make sense? And why would that be useful?

shanup · December 30, 2022, 11:28pm

Hello @Elzear_Young

By looking at the probability distribution of X, we are able to assess if a particular value of X is highly probable or not.

As a simple example: if a value of x is highly probable (i.e., such a value of x happens many times and is not a rare occurence), then we don’t need to consider it as an anomaly. However, if a certain value of x is not that probable (i.e., such a value of x is a rare occurence) then the chances are higher for that to be representative of an anomaly.

rmwkwok · December 31, 2022, 12:54am

Hello @Elzear_Young,

I think @shanup has explained well. However, I was wondering why you said the following:

Why would it be always zero? The normal distribution isn’t zero everywhere, why would it be always zero?

Raymond

Elzear_Young · December 31, 2022, 10:38am

@rmwkwok because the probabilities of all points have to add up to 1, but the normal distribution is given on uncountably infinitely many points, so they can’t have positive probabilities (or they would add up to infinity).

rmwkwok · December 31, 2022, 11:10am

Ah, so I think @Elzear_Young you are arguing that the normal distribution is a probability density function, so without talking about a region of x, we can’t sum over it to get a probability mass.

We actually had that discussion before, and I would highly suggest you to perhaps go through that again or just read my conclusion. This link will bring you to my conclusion of the discussion but please start from the first post of the thread if you prefer to. It would take some time to read the whole thread but it would worth it if we had covered all your concerns.

Cheers,
Raymond

Topic		Replies	Views
Probability for anomaly detection Unsupervised Learning, Recommenders, Reinforcement week-1	14	885	August 9, 2022
Gaussian distribution in anomoly detection Unsupervised Learning, Recommenders, Reinforcement week-1	4	604	July 7, 2023
C3_W1_Anomaly_Detection Questions Unsupervised Learning, Recommenders, Reinforcement week-1	2	148	June 1, 2024
C3_W1 Why use the Gaussian distribution Unsupervised Learning, Recommenders, Reinforcement week-1	3	578	September 9, 2022
Calculation of p(x) in Anomaly Detection Unsupervised Learning, Recommenders, Reinforcement week-1	1	504	December 29, 2022

C3_W1 definition of probability

Related topics