What detects the anomalies for testing the anomaly detector?

Andrei_Landon · January 25, 2024, 6:58pm

In the last few videos on anomaly detection it’s mentioned that tuning & testing an anomaly detector requires the test set or cross-validation set to have some labeled anomalies in it. If anomalies are so rare then how are they found so that they can be labeled? Obviously the anomaly detector itself can’t be used to find them; that would be cheating. The only way I can think of is to use anomalies that made themselves known by their real-world consequences, like the last transactions on a credit card before it got reported stolen or the manufacturer’s old testing records on a jet engine that caught fire. But is that really how it’s done? That way of collecting data seems like it would introduce bias - not to mention that it’s less than ideal to leave the anomaly detector untested until something goes horribly wrong.

TMosh · January 25, 2024, 7:04pm

One could use a detailed and expensive expert investigation of a small population to discover the anomalies.

Evidence from past performance or failures could also be used.

Yes, that’s one method.

Topic		Replies	Views
Anomaly Detection Practice Lab - labeled data Unsupervised Learning, Recommenders, Reinforcement week-module-1	1	499	February 18, 2023
Anomaly algorithm - video difference Unsupervised Learning, Recommenders, Reinforcement week-module-1	6	59	July 10, 2024
Anomaly Detection vs Supervised Learning Unsupervised Learning, Recommenders, Reinforcement week-module-1	11	635	August 11, 2025
Anomaly detection - tune X_j Unsupervised Learning, Recommenders, Reinforcement week-module-1	6	295	March 5, 2024
Labeled data in Anamoly Detection Unsupervised Learning, Recommenders, Reinforcement week-module-1	1	55	October 30, 2025

What detects the anomalies for testing the anomaly detector?

Related topics