Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Duong, Song, Bronnec, Florian Le, Allauzen, Alexandre, Guigue, Vincent, Lumbreras, Alberto, Soulier, Laure, Gallinari, Patrick
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.13674
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912238331232256
author	Duong, Song Bronnec, Florian Le Allauzen, Alexandre Guigue, Vincent Lumbreras, Alberto Soulier, Laure Gallinari, Patrick
author_facet	Duong, Song Bronnec, Florian Le Allauzen, Alexandre Guigue, Vincent Lumbreras, Alberto Soulier, Laure Gallinari, Patrick
contents	Large Language Models (LLMs), when used for conditional text generation, often produce hallucinations, i.e., information that is unfaithful or not grounded in the input context. This issue arises in typical conditional text generation tasks, such as text summarization and data-to-text generation, where the goal is to produce fluent text based on contextual input. When fine-tuned on specific domains, LLMs struggle to provide faithful answers to a given context, often adding information or generating errors. One underlying cause of this issue is that LLMs rely on statistical patterns learned from their training data. This reliance can interfere with the model's ability to stay faithful to a provided context, leading to the generation of ungrounded information. We build upon this observation and introduce a novel self-supervised method for generating a training set of unfaithful samples. We then refine the model using a training process that encourages the generation of grounded outputs over unfaithful ones, drawing on preference-based training. Our approach leads to significantly more grounded text generation, outperforming existing self-supervised techniques in faithfulness, as evaluated through automatic metrics, LLM-based assessments, and human evaluations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_13674
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation Duong, Song Bronnec, Florian Le Allauzen, Alexandre Guigue, Vincent Lumbreras, Alberto Soulier, Laure Gallinari, Patrick Computation and Language Large Language Models (LLMs), when used for conditional text generation, often produce hallucinations, i.e., information that is unfaithful or not grounded in the input context. This issue arises in typical conditional text generation tasks, such as text summarization and data-to-text generation, where the goal is to produce fluent text based on contextual input. When fine-tuned on specific domains, LLMs struggle to provide faithful answers to a given context, often adding information or generating errors. One underlying cause of this issue is that LLMs rely on statistical patterns learned from their training data. This reliance can interfere with the model's ability to stay faithful to a provided context, leading to the generation of ungrounded information. We build upon this observation and introduce a novel self-supervised method for generating a training set of unfaithful samples. We then refine the model using a training process that encourages the generation of grounded outputs over unfaithful ones, drawing on preference-based training. Our approach leads to significantly more grounded text generation, outperforming existing self-supervised techniques in faithfulness, as evaluated through automatic metrics, LLM-based assessments, and human evaluations.
title	SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
topic	Computation and Language
url	https://arxiv.org/abs/2502.13674

Similar Items