Supervised learning confusion

Terry_Gray · November 15, 2023, 6:58pm

Newbie question: Andrew says “LLMs are built by using supervised learning to repeatedly predict the next word”. I understand that Supervised learning implies use of an UNlabelled dataset. So what evaluation function can be used when the model guesses a possible next word? (I’ve read elsewhere that GPT uses “unsupervised self-supervised” training, just to add to my confusion! The Self-supervised method makes sense to me.) Thank you for any clarification…

TMosh · November 15, 2023, 7:08pm

No, that’s not correct.
Supervised learning always uses labeled data.
Unsupervised learning uses unlabeled data.

Large Language Models are a strange beast, because they’re learning to predict letters based on a big collection of written works. So the training set labels are the letters themselves.

Terry_Gray · November 15, 2023, 7:21pm

Thank you! Yes, I see I had a brain-freeze when I entered my question, as I knew supervised implied labelled data. But given that, I’m still confused on how the “probable next word” guess is evaluated, given that the input dataset is indeed UNlabelled (unless split into test subsets with masking, i.e. using “Self-Supervised” training, which is apparently considered “Unsupervised” training.)

TMosh · November 15, 2023, 8:00pm

It’s not unlabeled. Since this is a sequence model, we’re predicting the sequence of letters. So the data set itself provides the labels.

The “supervised” and “unsupervised” terminology doesn’t directly apply to this situation, because those date back to classic batch processing methods for making simple predictions.

Terry_Gray · November 15, 2023, 8:02pm

OK, thanks!

Topic		Replies	Views
Understanding Nature of Problem in case where test data is not labeled AI Discussions	1	55	August 7, 2022
About logistic regression regression in NLP NLP with Classification and Vector Spaces week-module-1	2	542	August 18, 2022
Different between supervised and unsupervised learning algorithm Supervised ML: Regression and Classification week-module-1	1	529	January 26, 2023
Backpropagation algorithm Neural Networks and Deep Learning coursera-platform	3	634	December 10, 2022
Possible error in quiz 2 Introduction to Generative AI for Software Develop week-module-1 , coursera-platform	3	108	September 26, 2024

Supervised learning confusion

Related topics