SPOILER: Question about Quiz Item in Week 1

Scooter2000 · October 7, 2022, 9:49pm

I have a question about one of the quiz items in week 1.
Please DO NOT READ further if you have not taken the quiz.
Remember your honor!!!
.
.
.
.
.
.

Okay. The quiz asked True or False: The only way to acquire data for a supervised learning algorithm is to manually label it. I.e., given the input A, to ask a human to provide B.

I answered false, correct answer was True… can someone explain this further? How could a supervised learning item extract insights from data that isn’t labeled by a human? Thank you!

vignesh18 · October 8, 2022, 5:24am

@Scooter2000 Welcome to the community.

A supervised learning algorithm requires you to provide a set of inputs and output(s) for each training example. It’s the presence of the output parameters that make it supervised learning and not if they were labeled by humans. An example would be trying to estimate housing prices based on size of the house, number of floors, etc. where the training data doesn’t need to be labeled by humans.

Scooter2000 · October 10, 2022, 1:08pm

Thank you! That does clear it up.

Amit_Shukla · October 15, 2022, 5:01am

Hi @Scooter2000 , thanks for posting your doubt.

So I think that there is a bit of ambiguity in question that you have asked. So by above statement :

The only way to acquire data for a supervised learning algorithm is to manually label it. I.e., given the input A, to ask a human to provide B.

You answered it to be false, i.e you think that its not the only way of acquiring data for supervised learning. While the answer is TRUE.

So I would say that actually for acquiring data for supervised learning, we do need manual labelling and its actually the only way to do so. Why ? See basically supervised learning algorithms work on labelled data where input and output feature value is specified. For collection and developing dataset for supervised algorithms, we need output feature value in real world, which can be only done by humans i.e there is no other way. Most Companies that require such data often spend months and a lot of money for collection of such data and its labelling cause it is indeed done by humans by hand only. Its actually pretty obvious to see that machines will eventually learn from data that you are going to provide it. If you give wrong type and valued data, your model itself would not produce any beneficial outcome. Thus the only way left is that people have to correctly label data for machines to learn correctly.
One can also say that we can use synthetic data i.e to produce new data from data we already have, but still you need some initial data to start with, which indeed is a result of human labelling. Thus the above given statement is true. The question actually is asking that Is Human Labelling of data , the only way of acquiring data for supervised model and YES that is true.

I hope I have answered your query,

Thanks And Regards,
@Amit_Shukla

Topic		Replies	Views
Quiz of week 2 \| You want to use supervised learning for automated resume screening, as in the example above. Which of the following statements about the Training Set are true? (Select all that apply.) AI For Everyone week-2	3	1046	April 21, 2023
Obtaining Labels for Fine-Tuning LLMs Generative AI with Large Language Models week-2	5	575	July 14, 2023
Week 2 Quiz 2 Q15 Structuring Machine Learning Projects	2	602	June 1, 2022
Getting labeled data from humans even when ML better than humans Structuring Machine Learning Projects	1	526	February 15, 2023
Automating labeling process for supervised learning AI Discussions ai-discussions , data-centric	1	66	May 16, 2023

SPOILER: Question about Quiz Item in Week 1

Related topics