Data preparation for Instruction fine-tuning: where the labels come from?

Mikhail_Kozine · January 19, 2024, 1:31am

In the second video of the week2 (Instruction fine-tuning) Mike describes the data prep tools for the LLM fine tuning. Large corpus of (say) amazon reviews are formatted as prompt, fed to LLM and the outputs are compared with the labels.
Question: Where the labels come from? For example, the amazon review “Godfather is great” shall be classified as positive, but where this ground truth come from? It is not present in the raw Amazon review…

gent.spah · January 19, 2024, 11:33am

What does the review say, maybe the ground truth is a short summary of the entire review done by humans!

Topic		Replies	Views
Obtaining Labels for Fine-Tuning LLMs Generative AI with Large Language Models week-2	5	575	July 14, 2023
Generative AI project in e-commerce domain AI Discussions ai-discussions , project	3	344	March 22, 2024
Can you mix and match different types of data? Finetuning Large Language Models	2	114	September 21, 2023
Week 2: Instruction fine-tuning Generative AI with Large Language Models llm , prompting	1	60	November 18, 2024
Is pre-training the unsupervised training of an LLM? Generative AI with Large Language Models week-2	10	262	July 24, 2024

Data preparation for Instruction fine-tuning: where the labels come from?

Related topics