Question about optimal parameters and training dataset for Finetuning

Satish1 · August 24, 2023, 5:59pm

The lecture - Scaling laws and compute-optimal models mentions an optimal training dataset in relation to the number of parameters the model is being trained (approx 20 times) in the context of pretraining. Do these rules apply to Finetuning as well?

gent.spah · August 25, 2023, 7:58am

It depends on the kind of fine tuning, if its full scale fine tuning changing the wights of the model probably yes, if its PEFT or LORA (you will learn about these later on) probably not.

Topic		Replies	Views
Lab 2 - What training parameters were used to full train the LoRA tuned model? Generative AI with Large Language Models week-2	2	73	June 25, 2024
Parameters for Fine-Tuning locally Generative AI with Large Language Models week-2	1	458	July 12, 2023
PEFT training Generative AI with Large Language Models week-2	1	494	January 16, 2024
Help in model training strategies (PEFT/LORA + RAG) AI Discussions ai-discussions , project	0	28	November 2, 2024
Week 2 Lab - what parameters to use to fully fine-tune the model? (part 2.2) Generative AI with Large Language Models ai-discussions	4	31	March 11, 2025

Question about optimal parameters and training dataset for Finetuning

Related topics