Supervised Fine-tuning

Charlie_DataScience · April 17, 2024, 6:57pm

Indeed, it makes sense, and that’s certainly an alternative approach. However, including here the desired output in the model input serves a specific purpose.

By providing the desired output along with the instruction and input text, the model learns not only to generate text based on the input but also to explicitly understand what the correct output should be for that particular input and instruction. This essentially guides the models towards producing outputs that align more closely with the desired output during fine-tuning.

If we don’t include the desired output in the model input, the model would solely rely on comparing its generated output with the desired output after processing the input and instruction. While this approach can still work, when we provide the desired output as part of the input this can improve the learning process by giving the model more direct information about what it should aim to produce, and usually leads to faster and more accurate fine-tuning results.

Topic		Replies	Views
Supervised fine-tuning (SFT), instruction fine-tuning and full fine-tuning Generative AI with Large Language Models week-module-2	1	3002	January 10, 2024
Is pre-training the unsupervised training of an LLM? Generative AI with Large Language Models week-module-2	10	284	July 24, 2024
Instruct fine-tuning vs Vanilla fine-tuning Generative AI with Large Language Models week-module-2	5	1534	March 15, 2024
Week 2: Instruction fine-tuning Generative AI with Large Language Models llm , prompting	1	65	November 18, 2024
Week 2, Question 1 Generative AI with Large Language Models week-module-2	1	544	November 17, 2023

Supervised Fine-tuning

Related topics