Using fine-tuning for giving LLM static domain-specific knowledge

Marcin.nagy · January 30, 2025, 5:24pm

I’m building a solution that requires providing LLM with some domain-specific knowledge in order for it to answer my questions sufficiently accurate. This domain-specific knowledge is static, it’s like a set of facts to take into account while responding to my prompt.

At the moment, I add this domain-specific information to my prompt as in-context learning, but it has grown quite significantly, so my single question takes more than 6k input tokens.

Does it make sense to use fine-tuning to provide my model with this domain-specific knowledge?

TMosh · January 30, 2025, 6:56pm

Yes.

Marcin.nagy · January 30, 2025, 7:45pm

Thanks for a quick reply. Could you give me a little more of explanation behind it?

The way I understand how fine-tuning works is that a developer is supposed to prepare a huge number (e.g., 1000) of prompt-completion pairs that are somewhat similar for a given problem. How should it look in the case of providing the model with domain-specific knowledge? My knowledge is just a long text narrative. There are no prompts and completions within it.

TMosh · January 30, 2025, 9:47pm

It’s not really my area of expertise.
Hopefully someone else will reply here.

Topic		Replies	Views
ChatGPT model tuning AI Discussions	3	88	May 15, 2023
How to format training data for a domain-specific AI model? AI Discussions ai-discussions , ai-question	4	176	March 20, 2025
Full Fine tuning of LLMs Finetuning Large Language Models	2	161	November 3, 2024
How to fine-tuning from a stack of PDFs which are not in Q&A format? Finetuning Large Language Models	6	3531	May 10, 2024
Finetune model with conversations GenAI with LLMs Resources	1	343	August 26, 2023

Using fine-tuning for giving LLM static domain-specific knowledge

Related topics