Examine mislabelled data

Chiang_Yuhan · March 14, 2024, 2:04am

Any strategies on how to manually identify mislabelled data?

I’m only doing a binary classification task with sources from two pools of 1-dimensional data.

Please leave your suggestions!

Thanks
Yuhan Chiang

TMosh · March 14, 2024, 4:07am

Find the index values there the prediction and the label are incorrect.
Inspect the examples that have those indices.

Chiang_Yuhan · March 14, 2024, 5:49am

I think in the dev (or validation set) in keras randomly shuffles the data every iteration, therefore I might only be able to do it on the test set. Is it customary to use it only on the train set?

Also, are there any functions that I could use to manually do this? I was using the model.evaluate function and it cannot map out all the examples. I arrange my examples into numpy arrays.

Thank you for your reply, I hope you can just give me a little more hints.

Yuhan

Topic		Replies	Views
Course 3 Week 2 - Cleaning Up Incorrectly Labeled Data Structuring Machine Learning Projects coursera-platform	1	524	October 7, 2022
Cleaning Up Incorrectly Labeled Data - ML Strategy \| Coursera Structuring Machine Learning Projects week-module-2 , coursera-platform	4	247	April 11, 2024
Week 2_error analysis Structuring Machine Learning Projects coursera-platform	2	573	May 25, 2021
Incorrect Labelled Data Structuring Machine Learning Projects coursera-platform	1	559	October 23, 2021
Confusion Matrix Accuracy Problem AI Discussions	71	628	November 4, 2023

Examine mislabelled data

Related topics