Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning"

stevejalim · July 24, 2023, 1:52pm

May I just check my understanding here?

Step 2.1 in the Lab notebook contains a step where we take example data to generate multiple prompts for the LLM, but we’re not giving any examples of what a good summary looks like:

2.1 - Preprocess the Dialog-Summary Dataset

You need to convert the dialog-summary (prompt-response) pairs into explicit instructions for the LLM. Prepend an instruction to the start of the dialog with Summarize the following conversation and to the start of the summary with Summary as follows:

Training prompt (dialogue):

Summarize the following conversation.

   Chris: This is his part of the conversation.
   Antje: This is her part of the conversation.

Summary:

Training response (summary):
Both Chris and Antje participated in the conversation.

Later on we can see that the results of such training are significant, but what’s slightly blowing my mind is that this seems to basically be unsupervised training, yep? In contrast to in-context learning (which if sort of a form of supervision) we’re just giving it N prompts and getting N completions based on no examples of what a good completion looks like. And yet it works well. Am I understanding this correctly? If so, wow.

gent.spah · July 24, 2023, 4:33pm

This is not a training process, its just infering from what the LLM has previously learned during its original training.

Dont forget that this model had aleardy been trained with a large dataset and is capable of responding to prompts (sometimes with good completion sometimes not).

stevejalim · July 24, 2023, 5:36pm

Yeap, I get that it’s been pre-trained, but I think the fact that the notebook talks about training in multiple places is what’s confused me

eg the example above, or, a little later in the workbook

Now utilize the built-in Hugging Face Trainer class (see the documentation here). Pass the preprocessed dataset with reference to the original model.

or, simply

trainer.train()

it’s easy to assume (incorrectly in my case ) that there’s further active training going on.

Is there some extra reading anyone could recommend that recaps full fine-tuning at an appropriate level (new/intermediate) for this course, please?

gent.spah · July 24, 2023, 8:06pm

I dont know any other than the course material or its suggestions

Topic		Replies	Views
Lab 2 Week 2 Generative AI with Large Language Models week-module-2	1	165	April 22, 2024
Week 2: Instruction fine-tuning Generative AI with Large Language Models llm , prompting	1	70	November 18, 2024
Is pre-training the unsupervised training of an LLM? Generative AI with Large Language Models week-module-2	10	326	July 24, 2024
Week 2, Question 1 Generative AI with Large Language Models week-module-2	1	568	November 17, 2023
Prompt based task training Generative AI with Large Language Models week-module-2	2	512	July 11, 2023

Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning"

2.1 - Preprocess the Dialog-Summary Dataset

Related topics