PEFT-LoRA: model performance

David_Hillmann · October 3, 2023, 10:06am

Using Lab2 notebook I tried to experiment with some fine-tuning with LoRA (on GPU) but got much worse ROUGE scores than even the original model. I wonder why.

I was using following training arguments, with ~1250 question-answer pairs for training (10x more than the quick notebook training example):

peft_training_args = TrainingArguments(
    output_dir=output_dir,
    auto_find_batch_size=True,
    learning_rate=1e-3,
    num_train_epochs=1,
    logging_steps=1,
    max_steps=-1    
)

My thoughs are: The lower rank matrices are initialized with random values, so if I don’t train long enough, the addition of those adapters might just add more randomness than knowledge to my model. Any thoughts if I am on the right path here?

If I am right, how should I train (how many epochs? full dataset of 12k pairs?) to achieve similar results as the PEFT-LoRA checkpoint we’re using in the Lab?

Topic		Replies	Views
W2 lab lora config and training parameters offline model Generative AI with Large Language Models week-module-2	2	388	October 31, 2023
Week 2 Lab: Training configuration of the PEFT model Generative AI with Large Language Models ai-discussions	3	72	November 21, 2024
Lab 2 - What training parameters were used to full train the LoRA tuned model? Generative AI with Large Language Models week-module-2	2	79	June 25, 2024
Failing to achieve 2 folds improvement of the peft lora example in week2 lab Generative AI with Large Language Models lab-help	4	45	April 29, 2025
PEFT fine-tuning on Flan-t5-base model does not change inference results Generative AI with Large Language Models week-module-2	2	370	March 14, 2024

PEFT-LoRA: model performance

Related topics