Hallucination in summarization

Mayank11 · September 7, 2023, 5:21am

I am trying to fine-tune a pre-trained GPT model on summarization. I have manually prepared the training data and ensured that the summary is created entirely from the input text in each and every example. However, after training, during inference the model is hallucinating in the sense that it is presenting ideas that cannot be inferred from input and sometimes even giving entities not even present in the input.

Ques 1: Why could this be happening and how can I correct this?

Ques 2: Is there any way I can automatically evaluate if the generate summary is producing content that cannot be inferred from input?

TMosh · September 7, 2023, 5:39am

Hallucinations happen because LLM’s are simply statistical models of the words and sentences they were trained on. They just generate text in what it thinks is a probable sequence. It has no concept of ideas, just sequences of words.

Preventing this behavior is where a lot of current LLM research is working.

Mayank11 · September 7, 2023, 7:54am

My use-case is quite narrow. All I want is for the output to be restricted to the input text provided. I have worked with out-of-the-box Pegasus models for summary and I didn’t see that many hallucinations there. Then we have ChatGPT - I have not seen it hallucinate even once on my summarization use case.

Is it the dataset size, or is it diversity of examples or something else, I am not sure at this point. But for a narrow use-case like mine, I think there should already be a known solution to significantly minimize (if not eliminate) hallucinations.

Topic		Replies	Views
CACM 2025-01: "GPTs and Hallucinations - Why do large language models hallucinate?" AI Discussions ai-discussions	0	29	March 10, 2025
Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework News and Announcements dl-ai-learning-platform	3	352	October 26, 2023
What happens if Generative model starts hallucinating in the data for CoT Generative AI with Large Language Models week-module-3	3	417	July 15, 2023
Model Limitations: Hallucinations ChatGPT Prompt Engineering for Developers	3	160	June 29, 2023
Week 1 LLMs as a thought partner Generative AI for Everyone week-module-1	1	105	January 5, 2025

Hallucination in summarization

Related topics