Alternative method for Anomaly Detection (Week 1)

shankha · February 28, 2025, 10:56pm

Dear Andrew,

I am a great fan of your courses and have followed a lot of them. Anomaly detection is the only item so far, where I felt the method of gaussian MLE estimation you are suggesting is not optimal and they may be a better way.

Here is my suggestion:

Why not using Kernel Density estimates?

Method: We estimate the probability distribution as follows:

P(x) = 1/m * sum_i=1^m( prod_j=1^n( G(x; X_j^i, sigma) ) )
where
G is the one-variable gaussian function shown in the course
X^i are training examples
m is the number of training examples and n is the number of features.

Just like epsilon controls precision and recall, in my above suggestion the parameter sigma will also control precision and recall; high sigma → low precision, high recall and low sigma → high precision, low recall.

Although KDEs is not a proper statistical inference tool, however here we are not after exact distribution of the training data set.

The pros of my suggestion:

Can address correlated features
No need to scale non-Gaussian features to make them Gaussian. This process is already very cumbersome if there are more than 20 features for example.
The course method cannot address mixed distribution (double bell for example), while my suggestion can.

I will be very grateful to receive your feedback Andrew and it will help me gain deeper understanding of Anomaly detection.

Kind regards,
Shankha.

TMosh · March 1, 2025, 12:06am

Thanks for your suggestions.

Sorry, Andrew does not monitor the forums.

Topic		Replies	Views
Anomaly Detection Algorithm Unsupervised Learning, Recommenders, Reinforcement week-1	1	504	August 31, 2022
C3_W1_Anomaly_Detection Questions Unsupervised Learning, Recommenders, Reinforcement week-1	2	148	June 1, 2024
C3_W1 Why use the Gaussian distribution Unsupervised Learning, Recommenders, Reinforcement week-1	3	578	September 9, 2022
Anomaly Detection Improvement Issues Unsupervised Learning, Recommenders, Reinforcement week-1	12	525	July 9, 2023
Week 1 anomaly detection -- pass with 100% but it still isn't right Unsupervised Learning, Recommenders, Reinforcement week-1	1	264	December 22, 2023

Alternative method for Anomaly Detection (Week 1)

Related topics