How do we practically perform fine tuning & pretraining on an existing LLM model

RaviShivaprakash · May 20, 2025, 2:15pm

The lecture talks about an overview of how to fine tune and pretrain LLM models. But, how to achieve it practically? It would be good to know it from a developer’s perspective. In other words, how do I supply the smaller data set of 1000s of specific sample(in fine tuning for example) to say GPT-4o ?

TMosh · May 20, 2025, 2:59pm

Typically:

You obtain the full set of weights and architecture for the model.
You freeze all of the weights in the model, except for the output layer.
Then you train just the output layer weights using your specific additional examples.

The Deep Learning Specialization has an exercise with an example of this method.

RaviShivaprakash · May 20, 2025, 3:51pm

Thanks. I am not so familiar with ML and learning Generative AI application development. So, would adding smaller data sets to the model be exposed through the API (like OpenAI API) associated with the model or is there some other way to achieve it? And can closed source models be also fine tuned this way?

TMosh · May 20, 2025, 4:42pm

Sorry, I cannot answer about how the API for OpenAI works.

MatthewMcMullen · July 29, 2025, 12:38pm

Fine-tuning means taking a pre-trained large language model and teaching it new things with your own data, like customizing it for your company or task it can be a legal advice, medical help etc).
Pre training is what happens before fine-tuning process, it’s training a model from scratch on a scale of general text data (like books, articles, web data) to help it understand language.
Steps are as follows:

You choose a base model (like LLaMA, GPT, etc.).
For fine-tuning, you give it labeled examples (like question-answer pairs)
Train it for a few hours or days on GPUs.
Tools like LoRA, PEFT, or QLoRA help do this efficiently without needing huge computing power.
For pretraining, you’d need large scale data.
Most people skip pretraining and focus on fine-tuning to make existing models smarter for their specific needs.

Topic		Replies	Views
LLM application POC AI Discussions	1	60	May 5, 2023
What dataset to use for the fine tuning of a pretrained llm? NLP with Attention Models how-to-forum	11	319	June 21, 2024
Enroll in Finetuning Large Language Models! News and Announcements	2	255	August 25, 2023
Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning" Generative AI with Large Language Models week-module-2	3	431	July 24, 2023
Pre-training for Adaptation Generative AI with Large Language Models week-module-1	4	433	July 31, 2023

How do we practically perform fine tuning & pretraining on an existing LLM model

Related topics