Discussion of real-number evaluation

tbhaxor · February 27, 2023, 5:21am

So prof Ng said that changing the parameters of the system is faster when it is being developed, for example check supervised learning model.

Is he talking about supervised learning, because when I fast forwarded, he introduced labels in anomaly detection

rmwkwok · February 27, 2023, 6:05am

Please share around what time in which video did you quote them from.

tbhaxor · February 27, 2023, 9:28am

This video, 0:00 - 2:00 mins please watch between this timeframe

tbhaxor · February 27, 2023, 10:49am

The real number evaluation, is he talking about loss function?

tbhaxor · February 27, 2023, 11:40am

Oh wait, he explained here

rmwkwok · February 27, 2023, 10:46pm

Hello @tbhaxor,

I think you have found the answer yourself.

If we can evaluate, we can make new decision based on evaluation results. However, we can’t evaluate without any label, right? We need labels to tell whether the predictions are good or bad, right? However, we are talking about the anomaly detection in unsupervised manner, so how can we have labels?

It might seem contradictive but the key is for us to not go to the extreme that we have abolutely no labels at all. Instead we can still try to collect a few just for the purpose of evaluation. We don’t really have labels for all samples, and we don’t use the labels in the training process which still makes it an unsupervised learning.

We need a function for the evaluation. It can be the loss function, or it can be any other metric function of your choice. The loss function is critical in the training process, but we don’t have to limit ourselves to the same loss function at evaluation.

Cheers,
Raymond

tbhaxor · February 28, 2023, 8:21am

Yes, infact in clustering where we dont take any CV still it converges better. mean distance between centroid and data point is the real-number evaluation done there which is then compared with previous iteraction.

So without real-number eval, I dont see how model can converge

tbhaxor · February 28, 2023, 8:23am

Actually I dont think it is contradictory, because still model is learning from data that does not have label. But CV was label. So \epsilon value is indirectly influenced by the CV.

tbhaxor · February 28, 2023, 8:25am

Makes sense, One question on this. Do you think F1 score is loss function or just a metric function for evaluation?

TMosh · February 28, 2023, 9:16am

It’s a useful statistical matric.

rmwkwok · February 28, 2023, 10:38am

We also want the loss function to be differentiable. We need to compute gradients.

tbhaxor · February 28, 2023, 11:40am

Yeah but f1 score is not differentiable

https://towardsdatascience.com/the-unknown-benefits-of-using-a-soft-f1-loss-in-classification-systems-753902c0105

rmwkwok · February 28, 2023, 12:48pm

so I agree with Tom that it’s a useful metric.

TMosh · February 28, 2023, 5:50pm

That’s why we don’t use it as a loss function.

Topic		Replies	Views
Finding unusual events example: Why unlabeled data Unsupervised Learning, Recommenders, Reinforcement week-1	1	430	July 13, 2023
Anomaly Detection vs Supervised Learning Unsupervised Learning, Recommenders, Reinforcement week-1	2	382	May 15, 2024
C3_W1_Anomaly_Detection - exercise 2 question Unsupervised Learning, Recommenders, Reinforcement week-1	1	548	June 20, 2023
Many outliers vs real data Unsupervised Learning, Recommenders, Reinforcement week-1	2	430	June 7, 2023
Is collaborative filtering still Unsupervised? Unsupervised Learning, Recommenders, Reinforcement week-2	7	881	January 20, 2023

Discussion of real-number evaluation

Related topics