Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Haoran, Khalid, Maryam, Wu, Qiong, Gao, Jian, Cao, Cheng
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.02574
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915709670391808
author	Wang, Haoran Khalid, Maryam Wu, Qiong Gao, Jian Cao, Cheng
author_facet	Wang, Haoran Khalid, Maryam Wu, Qiong Gao, Jian Cao, Cheng
contents	Large language models (LLMs) are increasingly used in applications requiring factual accuracy, yet their outputs often contain hallucinated responses. While fact-checking can mitigate these errors, existing methods typically retrieve external evidence indiscriminately, overlooking the model's internal knowledge and potentially introducing irrelevant noise. Moreover, current systems lack targeted mechanisms to resolve specific uncertainties in the model's reasoning. Inspired by how humans fact-check, we argue that LLMs should adaptively decide whether to rely on internal knowledge or initiate retrieval based on their confidence in a given claim. We introduce Probabilistic Certainty and Consistency (PCC), a framework that estimates factual confidence by jointly modeling an LLM's probabilistic certainty and reasoning consistency. These confidence signals enable an adaptive verification strategy: the model answers directly when confident, triggers targeted retrieval when uncertain or inconsistent, and escalates to deep search when ambiguity is high. Our confidence-guided routing mechanism ensures that retrieval is invoked only when necessary, improving both efficiency and reliability. Extensive experiments across three challenging benchmarks show that PCC achieves better uncertainty quantification than verbalized confidence and consistently outperforms strong LLM-based fact-checking baselines. Furthermore, we demonstrate that PCC generalizes well across various LLMs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_02574
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency Wang, Haoran Khalid, Maryam Wu, Qiong Gao, Jian Cao, Cheng Computation and Language Artificial Intelligence Large language models (LLMs) are increasingly used in applications requiring factual accuracy, yet their outputs often contain hallucinated responses. While fact-checking can mitigate these errors, existing methods typically retrieve external evidence indiscriminately, overlooking the model's internal knowledge and potentially introducing irrelevant noise. Moreover, current systems lack targeted mechanisms to resolve specific uncertainties in the model's reasoning. Inspired by how humans fact-check, we argue that LLMs should adaptively decide whether to rely on internal knowledge or initiate retrieval based on their confidence in a given claim. We introduce Probabilistic Certainty and Consistency (PCC), a framework that estimates factual confidence by jointly modeling an LLM's probabilistic certainty and reasoning consistency. These confidence signals enable an adaptive verification strategy: the model answers directly when confident, triggers targeted retrieval when uncertain or inconsistent, and escalates to deep search when ambiguity is high. Our confidence-guided routing mechanism ensures that retrieval is invoked only when necessary, improving both efficiency and reliability. Extensive experiments across three challenging benchmarks show that PCC achieves better uncertainty quantification than verbalized confidence and consistently outperforms strong LLM-based fact-checking baselines. Furthermore, we demonstrate that PCC generalizes well across various LLMs.
title	Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2601.02574

Similar Items