Module 3: question about error analysis experiments scale

Zen_L · May 21, 2026, 5:47pm

Thank you for an excellent course! very much appreciated

regarding error analysis experiments scale (screenshot attached)

what does ‘small scale’ means exactly?

I understand that we create a smaller train dataset, with samples that address the fixes proposed.

But, how is the fine-tuning experiment?
(a) do a full fine-tuning, with the same base llm used before as input to fine-tune, but this time with a dataset that includes the original fine-tuning dataset PLUS the new smaller dataset with the fixes
(b) or do an iterative fine-tuning, the input is the previous fine-tuned model and only use as a train set the new smaller dataset with the fixes

for (b) probably some catastrophic forgetting can happen,
but (a) would take longer and more expensive

Thank you in advance!
Zen L

bong.seog.choi · May 22, 2026, 1:14am

It is recommended to include some of the original instruction/alignment data along with the new fixes to avoid catastrophic forgetting. Even better is to use a curated subset—look up “coreset selection” if you are interested. You will also want a regression benchmark for your application to quantify forgetting. Other helpful tactics include using a low learning rate and PEFT.

Zen_L · May 22, 2026, 6:08pm

Thank you!, I look at ‘coreset selection’ and looks helpful, I will look into it.
I do have evals that will let me know if scores degrade

Topic		Replies	Views
Question about optimal parameters and training dataset for Finetuning Generative AI with Large Language Models week-module-1	1	410	August 25, 2023
✨Enroll in Fine-tuning and Reinforcement Learning for LLMs: Intro to Post-Training News and Announcements ai-discussions , dl-ai-learning-platform	2	322	November 6, 2025
Can you mix and match different types of data? Finetuning Large Language Models	2	143	September 21, 2023
What dataset to use for the fine tuning of a pretrained llm? NLP with Attention Models how-to-forum	11	488	June 21, 2024
Dataset used for fine-tuning banghua/Qwen3-0.6B-SFT Post-training of LLMs	5	457	July 28, 2025

Module 3: question about error analysis experiments scale

Related topics