Incorrect Labelled Data

Anbu · September 7, 2021, 5:22pm

Hi Sir,

We had couple of doubts from the lecture Cleaning Up Incorrectly. Can you please help to clarify sir?

Statement 1: In the below pic, if we did sum up 8% + 43% + 61% + 6%, the total should come to 100% but not coming in this case . What could be the reason behind ?

Statement 2: But you don’t trust your dev set anymore to be correctly telling you whether this classifier is actually better than this because your 0.6% of these mistakes are due to incorrect labels.

My intuition about statement 2: Assume before fix incorrect label, classifier A 2% error better than classifier B 4% error evaluating against the dev set. But after fixing incorrect label in the dev set, now we will get classifier B 1% error than classifier A 2% error. Is it due to the reason we should not trust dev set ?

Adam_Hjerpe · October 23, 2021, 8:24am

Hi, regarding statement 1: Since a picture can be both Blurry and contain a Great Cat it will not sum to 100%.

Topic		Replies	Views
Overall dev set error after fixing incorrectly labeled data Structuring Machine Learning Projects	4	354	October 26, 2023
Is special error not summing to overal errors? Structuring Machine Learning Projects	5	531	May 16, 2023
Course 3 Week 2 - Cleaning Up Incorrectly Labeled Data Structuring Machine Learning Projects	1	524	October 7, 2022
Cleaning Up Incorrectly Labeled Data - ML Strategy \| Coursera Structuring Machine Learning Projects week-2	4	232	April 11, 2024
Cleaning Up Incorrectly Labeled Data Structuring Machine Learning Projects	2	561	October 10, 2022

Incorrect Labelled Data

Related topics