Difference between outlier and anomaly

Christian_Simonis · February 26, 2023, 12:59pm

For example a popular approach is that you can learn your normal behaviour as „normal cluster“ and if a certain data point is too far away from this cluster conclude it is an anomaly.

Autoencoders for example are a popular choice for anomaly detection or you have a sufficient amount of normal labels and the problem is suiting. Can you provide more details on your specific problem?

To differentiate you could e.g. check if the distribution assumptions are satisfied in total: e.g. if you are assuming a normal / Gaussian distribution, all normal data should follow this distribution including potential black swan events (i think you refer to them as statistical outliers) that only occur super rarely. After all the normal distribution is defined for an unlimited range. Sampling a very large, sufficient amount of representative data would make sure our true distribution will be approximated in a acceptable manner.

This thread might be worth a look, too: Anomaly Detection with Different Probability Distributions - #5 by Christian_Simonis

Best regards
Christian

Topic		Replies	Views
Many outliers vs real data Unsupervised Learning, Recommenders, Reinforcement week-1	2	430	June 7, 2023
Difference between Anomaly detection and classification Unsupervised Learning, Recommenders, Reinforcement week-1	3	579	July 28, 2022
Anomaly algorithm - video difference Unsupervised Learning, Recommenders, Reinforcement week-1	6	27	July 10, 2024
Anomaly detection - subpopulations , narrow normal distributions and false positives Unsupervised Learning, Recommenders, Reinforcement week-1	4	349	January 12, 2024
Anomaly Detection with Different Probability Distributions Unsupervised Learning, Recommenders, Reinforcement week-1	4	660	February 16, 2023

Difference between outlier and anomaly

Related topics