C3_W1_Anomaly_Detection Questions

jaejun02 · May 29, 2024, 2:06pm

Hello, I have few simple questions on anomaly detection algorithm.

First of all, the p(x) that is describe in the lecture, I believe it is not exactly “probability,” right? It is the probability density which might exceed 1 if the variance is extremely small. Or am I getting something wrong?

Secondly, what should do if I have a categorical variable that looks really good in detecting anomaly? I can’t fit a gaussian distribution on a discrete random variable, right?

Thanks!

TMosh · May 29, 2024, 5:31pm

I think you’re correct, Andrew is equally likely to use the words “probability” or “probability density”. He’s not a math lecturer.

I recommend you do an internet search for:
“anomaly detection with categorical features”.

There is a lot of literature.

rmwkwok · June 1, 2024, 12:25am

Hello @jaejun02,

I think this moment of the lecture has presented to us the critical idea that each independent feature contributes one probability density. Although they are all later assumed to be Gaussians, we can have a Multinormial if there is a categorical feature or a Binomial for boolean feature. Then, the rest of the algorithm should work the same way.

Cheers,
Raymond

Topic		Replies	Views
Categorical variables in anomaly detection Unsupervised Learning, Recommenders, Reinforcement week-module-1	4	683	September 22, 2022
Anomaly Detection Improvement Issues Unsupervised Learning, Recommenders, Reinforcement week-module-1	12	577	July 9, 2023
C3_W1_Anomaly Detection_Feature_Distribution Unsupervised Learning, Recommenders, Reinforcement week-module-1	1	512	March 5, 2023
Probability for anomaly detection Unsupervised Learning, Recommenders, Reinforcement week-module-1	14	1001	August 9, 2022
Calculation of p(x) in Anomaly Detection Unsupervised Learning, Recommenders, Reinforcement week-module-1	1	514	December 29, 2022

C3_W1_Anomaly_Detection Questions

Related topics