Measuring human-level performance

renzodibona · July 22, 2021, 5:15pm

In DLS/C3/W1 when talking about human-level performance, it is assumed that we can measure that against some gold standard, 100%-accurate performance (e.g. for the analysis of x-rays by radiologists).

In real life, where do we get that perfectly-classified dataset from against which we can then measure human-level performance (and the performance of our own code)?

manifest · July 30, 2021, 7:24pm

Hey @renzodibona,

I guess our best hope to find some research on the task we are dealing with
We can’t be 100% sure about the correctness of the gold standard, but assume it is accurate. Future research may improve previous results.

Topic		Replies	Views
How can we know the 'human-level error' in actual case? Structuring Machine Learning Projects coursera-platform	2	672	July 20, 2022
Human Level Performance, how to set it? Structuring Machine Learning Projects coursera-platform	3	704	January 3, 2022
How to get beyond human level performance? Structuring Machine Learning Projects coursera-platform	4	377	October 7, 2023
Approach when human-level performance not available Structuring Machine Learning Projects coursera-platform	2	624	May 17, 2021
How to measure human level performance against human made labels? Structuring Machine Learning Projects coursera-platform	8	732	July 12, 2022

Measuring human-level performance

Related topics