Saved in:
Bibliographic Details
Main Authors: Xu, Fan, Zhang, Huixuan, Zhang, Zhenliang, Wang, Jiahao, Wan, Xiaojun
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.19310
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917032522416128
author Xu, Fan
Zhang, Huixuan
Zhang, Zhenliang
Wang, Jiahao
Wan, Xiaojun
author_facet Xu, Fan
Zhang, Huixuan
Zhang, Zhenliang
Wang, Jiahao
Wan, Xiaojun
contents Current large language models (LLMs) often suffer from hallucination issues, i,e, generating content that appears factual but is actually unreliable. A typical hallucination detection pipeline involves response decomposition (i.e., claim extraction), query generation, evidence collection (i.e., search or retrieval), and claim verification. However, existing methods exhibit limitations in the first two stages, such as context loss during claim extraction and low specificity in query generation, resulting in degraded performance across the hallucination detection pipeline. In this work, we introduce JointCQ https://github.com/pku0xff/JointCQ, a joint claim-and-query generation framework designed to construct an effective and efficient claim-query generator. Our framework leverages elaborately designed evaluation criteria to filter synthesized training data, and finetunes a language model for joint claim extraction and query generation, providing reliable and informative inputs for downstream search and verification. Experimental results demonstrate that our method outperforms previous methods on multiple open-domain QA hallucination detection benchmarks, advancing the goal of more trustworthy and transparent language model systems.
format Preprint
id arxiv_https___arxiv_org_abs_2510_19310
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation
Xu, Fan
Zhang, Huixuan
Zhang, Zhenliang
Wang, Jiahao
Wan, Xiaojun
Computation and Language
Current large language models (LLMs) often suffer from hallucination issues, i,e, generating content that appears factual but is actually unreliable. A typical hallucination detection pipeline involves response decomposition (i.e., claim extraction), query generation, evidence collection (i.e., search or retrieval), and claim verification. However, existing methods exhibit limitations in the first two stages, such as context loss during claim extraction and low specificity in query generation, resulting in degraded performance across the hallucination detection pipeline. In this work, we introduce JointCQ https://github.com/pku0xff/JointCQ, a joint claim-and-query generation framework designed to construct an effective and efficient claim-query generator. Our framework leverages elaborately designed evaluation criteria to filter synthesized training data, and finetunes a language model for joint claim extraction and query generation, providing reliable and informative inputs for downstream search and verification. Experimental results demonstrate that our method outperforms previous methods on multiple open-domain QA hallucination detection benchmarks, advancing the goal of more trustworthy and transparent language model systems.
title JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation
topic Computation and Language
url https://arxiv.org/abs/2510.19310