Parameter Finetuning

Ephraimmm1 · July 9, 2025, 10:35pm

Please can someone explain how to use tthese in finetuning.

per_device_train_batch_size=1, # Batch size for each device (e.g., GPU) during training.
gradient_accumulation_steps=8,

Mr. Zu explained it a bit but I still don’t understand it.

My question

What about people who use cloud provideers for GPU when training how do we calculate the number of gpu used in the training. Thank you in advance for a response

Topic		Replies	Views
When we upgrade to a better GPU (V100 32G), how should we adjust the training parameters? I have come across some issues Finetuning Large Language Models	0	82	October 7, 2023
Fine tune the mode on GPU Generative AI with Large Language Models week-module-2	4	666	August 3, 2023
CPU Training Finetuning Large Language Models	0	24	January 9, 2025
Fine-tuning LLMs on TPU AI Discussions ai-discussions	5	556	August 28, 2024
Lab 2 - What training parameters were used to full train the LoRA tuned model? Generative AI with Large Language Models week-module-2	2	73	June 25, 2024