Fine-tuned LLM with LoRA

doffy1212 · March 28, 2025, 8:29am

Hello, I’m trying to create a chatbot to answer questions about an institution. The data is based on FAQ and private documents of the institution. I have tried generated synthetic-dataset from the private documents and combined it with the FAQ data and fine-tuned a Gemma model with LoRA, turns out after some testing and inference. The model hallucinate a lot, I have tried to provide some context to the prompt, and make the LLM only answer based on the context provided, the result is either forever-looping a single token until it runs out of max token or doesn’t generate anything.

Do you have any suggestion to solve this problem?

gent.spah · March 28, 2025, 10:44am

Perhaps, it needs a lot more synthetic data.

Topic		Replies	Views
MEMORY FINETUNNING: Data preparation for Chat. I only have long chunks of proprietary text data Improving Accuracy of LLM Applications	0	32	August 16, 2024
Llama-2-70b-chat-hf Model is adding irrelevant topics to output LangChain: Chat with Your Data	0	186	October 20, 2023
Finetune model with conversations GenAI with LLMs Resources	1	342	August 26, 2023
Instruction tuning for Quiz geenration AI Discussions ai-discussions , langchain , rag , project	1	184	March 8, 2024
Fine Tuning LLM AI Discussions llm , project	1	112	October 10, 2024

Fine-tuned LLM with LoRA

Related topics