Lab 2 - What training parameters were used to full train the LoRA tuned model?

ckhordiasma · June 25, 2024, 3:05am

I am trying to replicate the results of the pre-trained LoRA optimized model in lab 2 (in my own jupyter environment), but I am not having any success. So far I have tried

setting the training set to the full training set instead of every 100th sample
setting max_steps to 100, max epochs to 5

I am next going to try setting the learning rate to 1e-5 instead

Does anyone know what the actual training parameters were to make this model?

gent.spah · June 25, 2024, 10:16am

You mean fine tuning the model before PEFT? We as mentors have only the info that you also have in the class, I would suggest to check section (for the parameters given):

2.2 - Fine-Tune the Model with the Preprocessed Dataset

But it also mentions that to fully fine tune the model it will take a few hours so its giving you the final result of the full fine tuning ready to use below!

ckhordiasma · June 25, 2024, 11:16am

no, I mean fine-tuning the model using PEFT. I know it’s going to take longer, but I wanted practice actually knowing what parameters to use to train a useful model, not just download a pre-trained model.

I ended up doing the following with good results:

switch to a computer with GPU and modify code to use gpu (involved putting .to('cuda') at the end of the model definitions and the tokenizer definitions
learning rate of 1e-4
no max steps
5 epochs
full training dataset instead of only 1%
logging_steps=100 to reduce console output

This ended up doing 7790 training steps and took an hour and a half using a GPU. Used a g2-standard-8 vm, nvidia_L4 GPU, and 200GB SSD on google cloud.



---------------------------------------------------------------------------------------------------
BASELINE HUMAN SUMMARY:
#Person1# teaches #Person2# how to upgrade software and hardware in #Person2#'s system.
---------------------------------------------------------------------------------------------------
ORIGINAL MODEL:
#Person2# is considering upgrading #Person1#'s system to make up their own flyers and banners. #Person1# suggests adding a painting program to the software and upgrading the hardware. #Person2# suggests adding a CD-ROM drive.
---------------------------------------------------------------------------------------------------
PEFT MODEL: #Person2# wants to upgrade #Person2#'s system and hardware. #Person1# recommends adding a painting program to #Person2#'s software and adding a CD-ROM drive.

Topic		Replies	Views
Week 2 Lab: Training configuration of the PEFT model Generative AI with Large Language Models ai-discussions	3	48	November 21, 2024
W2 lab lora config and training parameters offline model Generative AI with Large Language Models week-2	2	372	October 31, 2023
PEFT-LoRA: model performance Generative AI with Large Language Models week-2	0	460	October 3, 2023
Week 2 Lab - what parameters to use to fully fine-tune the model? (part 2.2) Generative AI with Large Language Models ai-discussions	4	28	March 11, 2025
Week2: training args for offline models Generative AI with Large Language Models week-2	5	439	August 18, 2023

Lab 2 - What training parameters were used to full train the LoRA tuned model?

2.2 - Fine-Tune the Model with the Preprocessed Dataset

Related topics