Week 2: Instruction fine-tuning

agarwalamit081 · November 18, 2024, 12:28pm

Generative AI with Large Language Models
Week 2: Instruction fine-tuning

How can we achieve full fine-tuning, i.e., ensure that ALL the weights got updated?

Even if we perform once again the pre-training on new amount of massive data, how do we ensure that ALL weights got updated?
Is full fine-tuning only a theoretical concept unless we provide greatly enough data that will enable ALL weights to get modified?

Another question. How does the LLM separate the “main instruction” and “examples” from the rest of the content in the prompt?
There were few examples like: “Summarize the text…” or “Translate this sentence…”.
We also saw some examples with YAML templates using prompt template libraries but I could not follow it very well.

What kind of libraries do we use and is it simple YAML file with key,value pairs such as {Request: Response}, {Example1: Content} and so on? or do we need to use specific keyword for specific models?

Alireza_Saei · November 18, 2024, 12:50pm

Hi @agarwalamit081

You can try training on a sufficiently large and diverse dataset. Practically, achieving updates to every single weight can be hard.
They rely on prompt parsing mechanisms that understands patterns. Pre-defined formats, common phrasing (like “Summarize the text”), and fine-tuned training are helpful.
It depends. Some libraries use key-value pairs, but models require structured prompts depending on their training (like OpenAI’s API prompts). Libraries like LangChain simplify things but with their own conventions.

Hope it helps! Feel free to ask if you need further assistance.

Topic		Replies	Views
Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning" Generative AI with Large Language Models week-module-2	3	426	July 24, 2023
Week 2, Question 1 Generative AI with Large Language Models week-module-2	1	544	November 17, 2023
Supervised Fine-tuning Generative AI with Large Language Models week-module-2	3	448	April 17, 2024
Can you mix and match different types of data? Finetuning Large Language Models	2	115	September 21, 2023
Is pre-training the unsupervised training of an LLM? Generative AI with Large Language Models week-module-2	10	284	July 24, 2024

Week 2: Instruction fine-tuning

Related topics