Fine Tuning LLM

Dovud_Asadov · October 10, 2024, 7:07pm

I fine-tuned LLaMA 3.1 70B for my feedback-giver project using ORPO and SFT. After fine-tuning, when I give the model a prompt, it outputs the entire system message, prompt, and the response. My system message and input text together are about 1,300 tokens. Because of this, the model struggles to generate high-quality content, and the response time is very long. How can I adjust the model so that it only provides the output without repeating the input or system message?

Deepti_Prasad · October 10, 2024, 7:28pm

can you show the codes with a screenshot showing what output you are getting but what output you are looking for?

Topic		Replies	Views
Fine-tuned LLM with LoRA AI Discussions ai-discussions	1	62	March 28, 2025
Limitations of Pythia70M Finetuning Large Language Models	0	133	October 19, 2023
Faster Inference LLM AI Discussions ai-discussions , llm	1	65	October 28, 2024
Llama-2-70b-chat-hf Model is adding irrelevant topics to output LangChain: Chat with Your Data	0	186	October 20, 2023
Response cut-off for llama 8B intruct Generative AI with Large Language Models ai-discussions	6	156	August 27, 2024

Fine Tuning LLM

Related topics