Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Vladika, Juraj, Soydemir, Ihsan, Matthes, Florian
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.19607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909659038744576
author	Vladika, Juraj Soydemir, Ihsan Matthes, Florian
author_facet	Vladika, Juraj Soydemir, Ihsan Matthes, Florian
contents	While large language models (LLMs) have shown remarkable capabilities to generate coherent text, they suffer from the issue of hallucinations -- factually inaccurate statements. Among numerous approaches to tackle hallucinations, especially promising are the self-correcting methods. They leverage the multi-turn nature of LLMs to iteratively generate verification questions inquiring additional evidence, answer them with internal or external knowledge, and use that to refine the original response with the new corrections. These methods have been explored for encyclopedic generation, but less so for domains like news summarization. In this work, we investigate two state-of-the-art self-correcting systems by applying them to correct hallucinated summaries using evidence from three search engines. We analyze the results and provide insights into systems' performance, revealing interesting practical findings on the benefits of search engine snippets and few-shot prompts, as well as high alignment of G-Eval and human evaluation.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_19607
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Correcting Hallucinations in News Summaries: Exploration of Self-Correcting LLM Methods with External Knowledge Vladika, Juraj Soydemir, Ihsan Matthes, Florian Computation and Language While large language models (LLMs) have shown remarkable capabilities to generate coherent text, they suffer from the issue of hallucinations -- factually inaccurate statements. Among numerous approaches to tackle hallucinations, especially promising are the self-correcting methods. They leverage the multi-turn nature of LLMs to iteratively generate verification questions inquiring additional evidence, answer them with internal or external knowledge, and use that to refine the original response with the new corrections. These methods have been explored for encyclopedic generation, but less so for domains like news summarization. In this work, we investigate two state-of-the-art self-correcting systems by applying them to correct hallucinated summaries using evidence from three search engines. We analyze the results and provide insights into systems' performance, revealing interesting practical findings on the benefits of search engine snippets and few-shot prompts, as well as high alignment of G-Eval and human evaluation.
title	Correcting Hallucinations in News Summaries: Exploration of Self-Correcting LLM Methods with External Knowledge
topic	Computation and Language
url	https://arxiv.org/abs/2506.19607

Similar Items