Saved in:
Bibliographic Details
Main Authors: Vladika, Juraj, Soydemir, Ihsan, Matthes, Florian
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.19607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909659038744576
author Vladika, Juraj
Soydemir, Ihsan
Matthes, Florian
author_facet Vladika, Juraj
Soydemir, Ihsan
Matthes, Florian
contents While large language models (LLMs) have shown remarkable capabilities to generate coherent text, they suffer from the issue of hallucinations -- factually inaccurate statements. Among numerous approaches to tackle hallucinations, especially promising are the self-correcting methods. They leverage the multi-turn nature of LLMs to iteratively generate verification questions inquiring additional evidence, answer them with internal or external knowledge, and use that to refine the original response with the new corrections. These methods have been explored for encyclopedic generation, but less so for domains like news summarization. In this work, we investigate two state-of-the-art self-correcting systems by applying them to correct hallucinated summaries using evidence from three search engines. We analyze the results and provide insights into systems' performance, revealing interesting practical findings on the benefits of search engine snippets and few-shot prompts, as well as high alignment of G-Eval and human evaluation.
format Preprint
id arxiv_https___arxiv_org_abs_2506_19607
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Correcting Hallucinations in News Summaries: Exploration of Self-Correcting LLM Methods with External Knowledge
Vladika, Juraj
Soydemir, Ihsan
Matthes, Florian
Computation and Language
While large language models (LLMs) have shown remarkable capabilities to generate coherent text, they suffer from the issue of hallucinations -- factually inaccurate statements. Among numerous approaches to tackle hallucinations, especially promising are the self-correcting methods. They leverage the multi-turn nature of LLMs to iteratively generate verification questions inquiring additional evidence, answer them with internal or external knowledge, and use that to refine the original response with the new corrections. These methods have been explored for encyclopedic generation, but less so for domains like news summarization. In this work, we investigate two state-of-the-art self-correcting systems by applying them to correct hallucinated summaries using evidence from three search engines. We analyze the results and provide insights into systems' performance, revealing interesting practical findings on the benefits of search engine snippets and few-shot prompts, as well as high alignment of G-Eval and human evaluation.
title Correcting Hallucinations in News Summaries: Exploration of Self-Correcting LLM Methods with External Knowledge
topic Computation and Language
url https://arxiv.org/abs/2506.19607