Saved in:
Bibliographic Details
Main Authors: Zhu, Lei, Wang, Xiaobao, Yang, Jianbiao, Wang, Chenyang, He, Dongxiao, Wang, Longbiao, Dang, Jianwu
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.02277
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911737975930880
author Zhu, Lei
Wang, Xiaobao
Yang, Jianbiao
Wang, Chenyang
He, Dongxiao
Wang, Longbiao
Dang, Jianwu
author_facet Zhu, Lei
Wang, Xiaobao
Yang, Jianbiao
Wang, Chenyang
He, Dongxiao
Wang, Longbiao
Dang, Jianwu
contents Factual Error Correction (FEC) aims to revise inaccurate text into statements that are factually consistent with external evidence. Although recent methods perform well on single-hop correction, they often treat claims as atomic units and struggle with multi-hop cases that require compositional reasoning across multiple evidence sources. This challenge is further amplified by limited paired data and difficulties in locating semantic errors within complex reasoning chains. We present CECoR (Compositional Error Correction via Reasoning-aware Synthesis), a reasoning-aware framework that introduces a Decomposition and Injection paradigm for compositional error correction. CECoR decomposes multi-hop claims into interpretable reasoning steps and injects controlled perturbations to synthesize high-quality training pairs. A two-stage learning strategy combining supervised fine-tuning and reinforcement learning improves factual accuracy and robustness. Comprehensive evaluations show that CECoR achieves strong performance on multi-hop benchmarks, outperforming both distantly supervised methods and few-shot LLM baselines. It also generalizes effectively to single-hop correction and remains stable under noisy evidence, demonstrating its versatility for real-world factual correction.
format Preprint
id arxiv_https___arxiv_org_abs_2605_02277
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle CECOR: Correction-oriented synthetic data construction for factual error correction
Zhu, Lei
Wang, Xiaobao
Yang, Jianbiao
Wang, Chenyang
He, Dongxiao
Wang, Longbiao
Dang, Jianwu
Computation and Language
Factual Error Correction (FEC) aims to revise inaccurate text into statements that are factually consistent with external evidence. Although recent methods perform well on single-hop correction, they often treat claims as atomic units and struggle with multi-hop cases that require compositional reasoning across multiple evidence sources. This challenge is further amplified by limited paired data and difficulties in locating semantic errors within complex reasoning chains. We present CECoR (Compositional Error Correction via Reasoning-aware Synthesis), a reasoning-aware framework that introduces a Decomposition and Injection paradigm for compositional error correction. CECoR decomposes multi-hop claims into interpretable reasoning steps and injects controlled perturbations to synthesize high-quality training pairs. A two-stage learning strategy combining supervised fine-tuning and reinforcement learning improves factual accuracy and robustness. Comprehensive evaluations show that CECoR achieves strong performance on multi-hop benchmarks, outperforming both distantly supervised methods and few-shot LLM baselines. It also generalizes effectively to single-hop correction and remains stable under noisy evidence, demonstrating its versatility for real-world factual correction.
title CECOR: Correction-oriented synthetic data construction for factual error correction
topic Computation and Language
url https://arxiv.org/abs/2605.02277