Parameters for Fine-Tuning locally

Marc1n · July 10, 2023, 3:30pm

I would like to see how Fine-Tuning would work on a local machine rather than using a checkpoint.

For that to work, what parameters should I use for TrainingArguments()?

Here are defaults from the lab for PEFT:

peft_training_args = TrainingArguments(
output_dir=output_dir,
auto_find_batch_size=True,
learning_rate=1e-3, # Higher learning rate than full fine-tuning.
num_train_epochs=1,
logging_steps=1,
max_steps=1
)

Aiko · July 12, 2023, 10:24am

At the very least, I would remove the logging_steps and max_steps here since that limits your step total to 1 and logs every single one. I imagine this was put in for demonstration purposes and I think the defaults are quite reasonable.

For epochs, prowling the internet tells me that there’s a bit of a range of what people actually use. There are some that mention that quite often 4 is enough, while others will do 3, check the performance, and do 3 more until they reach their desired performance or hit diminishing returns or big performance loss.

I’m not so sure about the learning rate though. Is that your comment? If it’s not, then I’d try leaving it like that.

Topic		Replies	Views
PEFT training Generative AI with Large Language Models week-2	1	494	January 16, 2024
Week2: training args for offline models Generative AI with Large Language Models week-2	5	439	August 18, 2023
Hyper-parameters of that downloaded instruct_model Generative AI with Large Language Models week-2	4	407	August 4, 2023
Week 2 Lab - what parameters to use to fully fine-tune the model? (part 2.2) Generative AI with Large Language Models ai-discussions	4	31	March 11, 2025
Question about optimal parameters and training dataset for Finetuning Generative AI with Large Language Models week-1	1	388	August 25, 2023

Parameters for Fine-Tuning locally

Related topics