Gaussian distribution in anomoly detection

Numair · July 7, 2023, 6:20pm

I am confused how we are finding probabilities of all the different features. Correct me if I am wrong but from my understanding probability in a continuous distribution on any particular point is 0 and we have to find probabilities in certain ranges in continuous distributions. Can someone please explain this and also what the formula provided in the course(p(x)=1/2pi(e^-(x-u)sq/2*variance)) is doing?
Thanks in advance

TMosh · July 7, 2023, 6:41pm

We’re not using the probability, we’re using the distribution.

Christian_Simonis · July 7, 2023, 7:16pm

The probability is the outcome when you integrate mathematically over the probability density function (PDF). As you pointed out, taking only a point of this PDF does not really make sense since you cannot really interpret a single point here in a reasonable way (besides to be a literally infinitesimal small interval… as you pointed out, too!).

But if you integrate over a defined interval of the PDF you can derive a tangible probability, which you can interpret, e.g. in the example of an ROC analysis or when evaluating false positives etc., see also:

Please let me know if this helps, @Numair!

Best regards
Christian

Christian_Simonis · July 7, 2023, 7:24pm

I am not completely sure about the formula you wrote above. But I assume that you mean this one:

${isplaystyle {rac {1}{igma {qrt {2i }}}}e^{-{rac {1}{2}}eft({rac {x-u }{igma }}ight)^{2}}}$

It specifies the PDF of the popular (bell-shaped) normal distribution, also called Gaussian distribution, using the standard deviation σ and the mean μ.

(source)

see also this thread.

Please let me know if anything is open from your end, @Numair.

Best
Christian

rmwkwok · July 7, 2023, 8:24pm

Hello @Numair,

Just to add to the existing answers, it is true that at any single point, that formula (see the one shared in @Christian_Simonis for the complete form) does not give you a probability value, but a probably density value.

Consider the case where we actually care about the probability value, which is when we determine whether a feature value is beyond the anomaly threshold. If we are going serious about finding the probability value, we do integration over the probability density by integrating up the density from the threshold to infinity. However, knowing that there is a strictly decreasing relation between the probability density at a point x and the probability from x to \infty because the farther away x is beyond the threshold, the smaller the probability, it is sufficient for us to use the probability density to make that judgement of whether the feature is in the anomaly range.

It is of course very good and intuitive if we use the probability value for the judgement, but it is Okay too to use the probability density value for the judegement, because there is that strictly decreasing relationship between them.

Cheers,
Raymond

Topic		Replies	Views
C3_W1 definition of probability Unsupervised Learning, Recommenders, Reinforcement week-1	4	530	December 31, 2022
Probability for anomaly detection Unsupervised Learning, Recommenders, Reinforcement week-1	14	891	August 9, 2022
Understanding 𝐏(𝑥_𝑘∣𝐶_𝑖) = PDF_gaussian(𝑥_𝑘,𝜇_𝐶𝑖,𝜎_𝐶𝑖) in C3_W1_Assignment Probability & Statistics for Machine Learning &... week-1	5	462	July 10, 2023
Assignment Ex 6: PDF to calculate probabilities? Probability & Statistics for Machine Learning &... week-1	3	576	June 15, 2023
Calculation of p(x) in Anomaly Detection Unsupervised Learning, Recommenders, Reinforcement week-1	1	504	December 29, 2022

Gaussian distribution in anomoly detection

Related topics