Dataset for fine tuning SLM

madhu95 · March 28, 2025, 6:01am

Hey everyone, I am trying to fine tune a model llama3.2 1B on some policies I have. My application that I am building, takes an application form, using RAG gets relevant context from the policy and check for the eligibility of the application form. Finetuning is also mostly for my learning.

My confusion is what my dataset should look like.

data format/structure wise
data content wise

should the data look exactly as it would be used

or should it be simple question answer format

or should it be simple question answer format with context

also I might (not decided yet) want the model to be able answer more user questions once it gives eligibility response. then the input structure to the model will be different form earlier.

can you guys let me know what are the best approach for finetuning for my case.

will the model llama 3.2 1B or phi 2 be able to handle all that policy context plus application form details and be able to reason and return eligibility criteria?

Topic		Replies	Views
How to create dataset on a specific topic to fine tune llm? Finetuning Large Language Models	0	190	November 27, 2023
Finetune model with conversations GenAI with LLMs Resources	1	343	August 26, 2023
Collecting Custom dataset for fine tuning an open source LLM AI Discussions ai-discussions	0	92	February 8, 2024
Medical concepts extraction from documents through LLM finetuining Generative AI with Large Language Models week-module-2	13	926	September 13, 2023
MEMORY FINETUNNING: Data preparation for Chat. I only have long chunks of proprietary text data Improving Accuracy of LLM Applications	0	39	August 16, 2024

Dataset for fine tuning SLM

Related topics