Saved in:
Bibliographic Details
Main Authors: Duong, Song, Bronnec, Florian Le, Allauzen, Alexandre, Guigue, Vincent, Lumbreras, Alberto, Soulier, Laure, Gallinari, Patrick
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.13674
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912238331232256
author Duong, Song
Bronnec, Florian Le
Allauzen, Alexandre
Guigue, Vincent
Lumbreras, Alberto
Soulier, Laure
Gallinari, Patrick
author_facet Duong, Song
Bronnec, Florian Le
Allauzen, Alexandre
Guigue, Vincent
Lumbreras, Alberto
Soulier, Laure
Gallinari, Patrick
contents Large Language Models (LLMs), when used for conditional text generation, often produce hallucinations, i.e., information that is unfaithful or not grounded in the input context. This issue arises in typical conditional text generation tasks, such as text summarization and data-to-text generation, where the goal is to produce fluent text based on contextual input. When fine-tuned on specific domains, LLMs struggle to provide faithful answers to a given context, often adding information or generating errors. One underlying cause of this issue is that LLMs rely on statistical patterns learned from their training data. This reliance can interfere with the model's ability to stay faithful to a provided context, leading to the generation of ungrounded information. We build upon this observation and introduce a novel self-supervised method for generating a training set of unfaithful samples. We then refine the model using a training process that encourages the generation of grounded outputs over unfaithful ones, drawing on preference-based training. Our approach leads to significantly more grounded text generation, outperforming existing self-supervised techniques in faithfulness, as evaluated through automatic metrics, LLM-based assessments, and human evaluations.
format Preprint
id arxiv_https___arxiv_org_abs_2502_13674
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
Duong, Song
Bronnec, Florian Le
Allauzen, Alexandre
Guigue, Vincent
Lumbreras, Alberto
Soulier, Laure
Gallinari, Patrick
Computation and Language
Large Language Models (LLMs), when used for conditional text generation, often produce hallucinations, i.e., information that is unfaithful or not grounded in the input context. This issue arises in typical conditional text generation tasks, such as text summarization and data-to-text generation, where the goal is to produce fluent text based on contextual input. When fine-tuned on specific domains, LLMs struggle to provide faithful answers to a given context, often adding information or generating errors. One underlying cause of this issue is that LLMs rely on statistical patterns learned from their training data. This reliance can interfere with the model's ability to stay faithful to a provided context, leading to the generation of ungrounded information. We build upon this observation and introduce a novel self-supervised method for generating a training set of unfaithful samples. We then refine the model using a training process that encourages the generation of grounded outputs over unfaithful ones, drawing on preference-based training. Our approach leads to significantly more grounded text generation, outperforming existing self-supervised techniques in faithfulness, as evaluated through automatic metrics, LLM-based assessments, and human evaluations.
title SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
topic Computation and Language
url https://arxiv.org/abs/2502.13674