Difference between Anomaly detection and classification

waze · July 28, 2022, 9:38am

Hi, I was watching the first video of Anomaly detection in week 1 of course 3. Suddenly I couldn’t distinguish between the way in which anomaly detection and Classification work when Andrew NG was speaking about the Aircraft example.

Say In Binary Classification we use features to find out whether a test example belongs to a positive class or negative class. And here as well in Anomaly detection, we use features to detect Anomaly.

And another question is, in what sense is Anomaly detection unsupervised ? , since it uses features similar to classification task.

rmwkwok · July 28, 2022, 10:30am

Hello @waze,

A learning is unsupervised when you don’t give it any training labels. Both supervised and unsupervised learning use features.

In Binary classification, the learning algorithm is given the labels, so it knows which samples are positives and which are negatives. The learning algorithm attempts to draw a boundary in the feature space to best distinguish positive samples from negatives.

In our anomaly detection example, the algorithm does not know which samples are positive and which are negative. We consider most cases are positive and positive cases share similar features’ values, which means they tend to gather around with each other at a certain area in the feature space. In contrary, samples that are far away from the majority are more likely to be different (anomalous).

Given that this idea is about where in the feature space samples are more likely to gather around, this is a frequency problem, or a probablistic problem. When samples gather in a certain area in the space, you have a high probability to find samples in that area.

So, we apply guassian distributions to model the probability of samples showing up at each part of the whole feature space, where the mean parameters of the distributions should indicate the most crowded spot. Up to here, we already learned the model parameters (the means and the variance) with our data (features) without knowing in prior the labels.

Given the learnt parameters, we can tell at each point in the feature space, the chance of seeing a sample. If a sample falls in the area of high probability, it is seen as normal. If it falls in somewhere the gaussian model thinks it is very unlikely, then it is seen as anomalous.

Cheers,
Raymond

waze · July 28, 2022, 11:38am

Thank you so much for your fantastic explanation @rmwkwok. I really appreciate your effort in putting in all details. I got a clear idea.

rmwkwok · July 28, 2022, 12:23pm

You are welcome @waze. Enjoy Course 3
Raymond

Topic		Replies	Views
Anomaly Detection vs Supervised Learning Unsupervised Learning, Recommenders, Reinforcement week-1	2	408	May 15, 2024
Anomaly algorithm - video difference Unsupervised Learning, Recommenders, Reinforcement week-1	6	30	July 10, 2024
C3_W1_Anomaly_Detection Questions Unsupervised Learning, Recommenders, Reinforcement week-1	2	152	June 1, 2024
Multivariate normal distribution vs Gaussian Mixture Models Unsupervised Learning, Recommenders, Reinforcement week-1	1	612	August 30, 2022
Finding unusual events example: Why unlabeled data Unsupervised Learning, Recommenders, Reinforcement week-1	1	432	July 13, 2023

Difference between Anomaly detection and classification

Related topics