Hyper-parameters of that downloaded instruct_model

Real_Emily · July 20, 2023, 6:18am

Hi, I tried to train a fully fine tuned version of the model, which is named as “instruct_model” in the lab. I set num_train_epochs=5, and max_steps=7787. But my model performs a lot worse than the aws downloaded one, and is only a bit better than the original model. Does anyone know what hyper-parameters were used by the course instructor to train their model? Do I need to change or add other hyperparameters in training_args? I cannot figure out why the model I fine tuned using the same code does not perform.

gent.spah · July 20, 2023, 6:56am

In the lab (as far as I remember) they dont fine tune with the entire dataset, just a part of it. Also the model to be fine tuned propetly (all its weights) needs to go a lot more epochs than just 5.

Real_Emily · July 20, 2023, 7:08am

The model they let us download using the “!aws …” command was fine tuned using the entire dataset. They said it took several hours to train. Do you know what’s a reasonable epoch number I should use? I want to reproduce their model’s performance.

gent.spah · July 20, 2023, 7:28am

I dont know and their several hours might be days for you because they might have a lot of computing power!

FFloegel · August 4, 2023, 12:47pm

can we have the paramaters for the full PEFT run. I think that is an important learning/experience…

Topic		Replies	Views
Week2: training args for offline models Generative AI with Large Language Models week-2	5	439	August 18, 2023
Week 2 Lab - what parameters to use to fully fine-tune the model? (part 2.2) Generative AI with Large Language Models ai-discussions	4	31	March 11, 2025
Lab 2. Training params for fine-tuned models from AWS Generative AI with Large Language Models week-2	1	177	May 7, 2024
Parameters for Fine-Tuning locally Generative AI with Large Language Models week-2	1	458	July 12, 2023
PEFT training Generative AI with Large Language Models week-2	1	494	January 16, 2024

Hyper-parameters of that downloaded instruct_model

Related topics