Instruction tuning for Quiz geenration

ZalaSM5274 · March 6, 2024, 8:51pm

I have been working on a project that involves using LLM with RAG for making a scalable quiz generation application. In my first attempt, I used llama 2-7b model without fine tuning. However, this lead to hallucinations, main one being the model did not follow the quiz generating procedure prompt. Now this time I am using instruction tuning for making the model better. My main question is that what dataset should the model be given such that it behaves better at generating questions. Couple of things to note:

Cannot finetune on the course data, as I want to make it scalable such that it can be used for any field course. Hence, why i am using RAG.
Need to generate both numerical and theory based questions.
Need to know what structure will the dataset have.

A beginner at this stage, so any help appreciated.

dheey · March 8, 2024, 11:20pm

Why not try improving your prompt? Be very detailed and explicit. List out all possible cases you want to tackle in your prompt with examples, Don’t worry if the prompt gets long. Try it and see if you can get a better behavior from the LLM, A good prompt + RAG you have implemented should give satisfactory result.

Also where did you get Data you are passing as contexts (RAG)

Topic		Replies	Views
Dataset for fine tuning SLM AI Discussions data-centric	0	27	March 28, 2025
Instruct Tuning LLMs AI Discussions ai-discussions	0	18	August 20, 2024
Improve LLM effeciency AI Discussions ai-discussions	4	52	October 3, 2024
Week 2: Instruction fine-tuning Generative AI with Large Language Models llm , prompting	1	65	November 18, 2024
Llama 3.2 finetuning and evaluations? Introducing Multimodal Llama 3.2	6	104	October 18, 2024

Instruction tuning for Quiz geenration

Related topics