Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Xu, Fan, Zhang, Huixuan, Zhang, Zhenliang, Wang, Jiahao, Wan, Xiaojun
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2510.19310
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917032522416128
author	Xu, Fan Zhang, Huixuan Zhang, Zhenliang Wang, Jiahao Wan, Xiaojun
author_facet	Xu, Fan Zhang, Huixuan Zhang, Zhenliang Wang, Jiahao Wan, Xiaojun
contents	Current large language models (LLMs) often suffer from hallucination issues, i,e, generating content that appears factual but is actually unreliable. A typical hallucination detection pipeline involves response decomposition (i.e., claim extraction), query generation, evidence collection (i.e., search or retrieval), and claim verification. However, existing methods exhibit limitations in the first two stages, such as context loss during claim extraction and low specificity in query generation, resulting in degraded performance across the hallucination detection pipeline. In this work, we introduce JointCQ https://github.com/pku0xff/JointCQ, a joint claim-and-query generation framework designed to construct an effective and efficient claim-query generator. Our framework leverages elaborately designed evaluation criteria to filter synthesized training data, and finetunes a language model for joint claim extraction and query generation, providing reliable and informative inputs for downstream search and verification. Experimental results demonstrate that our method outperforms previous methods on multiple open-domain QA hallucination detection benchmarks, advancing the goal of more trustworthy and transparent language model systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_19310
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation Xu, Fan Zhang, Huixuan Zhang, Zhenliang Wang, Jiahao Wan, Xiaojun Computation and Language Current large language models (LLMs) often suffer from hallucination issues, i,e, generating content that appears factual but is actually unreliable. A typical hallucination detection pipeline involves response decomposition (i.e., claim extraction), query generation, evidence collection (i.e., search or retrieval), and claim verification. However, existing methods exhibit limitations in the first two stages, such as context loss during claim extraction and low specificity in query generation, resulting in degraded performance across the hallucination detection pipeline. In this work, we introduce JointCQ https://github.com/pku0xff/JointCQ, a joint claim-and-query generation framework designed to construct an effective and efficient claim-query generator. Our framework leverages elaborately designed evaluation criteria to filter synthesized training data, and finetunes a language model for joint claim extraction and query generation, providing reliable and informative inputs for downstream search and verification. Experimental results demonstrate that our method outperforms previous methods on multiple open-domain QA hallucination detection benchmarks, advancing the goal of more trustworthy and transparent language model systems.
title	JointCQ: Improving Factual Hallucination Detection with Joint Claim and Query Generation
topic	Computation and Language
url	https://arxiv.org/abs/2510.19310

Similar Items