Evaluation data set size for Fine-tuning and RAG

joaquingfx · August 13, 2024, 12:28am

If I fine-tune a model or use RAG, I would need to evaluate the model on the specific prompts about the fine tuning or the retrieval through rag specific information. To use a public dataset to evaluate would not have this specificity, therefore I would need to put together an evaluation dataset.

How big should this dataset be?

TMosh · August 13, 2024, 1:22am

There’s no numerical answer.

It needs to be enough data that you get good enough results for your needs.

joaquingfx · August 13, 2024, 10:35pm

Thanks for the response @TMosh ! If I’m implementing RAG for a client, would you recommend to ask that I ask such client a set of sample questions with their answers? Or should I let the user try the solution and add some scoring system to track the ones that were not correctly answered?

TMosh · August 13, 2024, 10:46pm

Sorry, that’s not my area of expertise. Hopefully another member of the community will provide some guidance.

Deepti_Prasad · August 13, 2024, 11:03pm

selection of evaluation dataset would be dependent on what kind of RAG application you are building. This in turn depends on your primary stockholders demands do’s and don’ts, features and investment.

more dataset is not always best for time and money related but can have better outcome. So balancing between all these criteria can help you decide how small dataset is enough to cover all the prompt features that doesn’t take too much time for your RAG application to use

Topic		Replies	Views
Instruction tuning for Quiz geenration AI Discussions ai-discussions , langchain , rag , project	1	184	March 8, 2024
How to train RAG-models? Generative AI with Large Language Models week-module-3	1	136	August 10, 2024
RAG Assessment AI Discussions ai-discussions	0	30	February 19, 2025
Help in model training strategies (PEFT/LORA + RAG) AI Discussions ai-discussions , project	0	32	November 2, 2024
How to use RAG and fine tune huge database example MagngoDB ,AWS RDS to build query bot regarding that database AI Discussions ai-discussions	3	255	January 20, 2024

Evaluation data set size for Fine-tuning and RAG

Related topics