Saved in:
| Main Author: | Karge, Jonas |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.22413 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Constructive Interpolation and Concept-Based Beth Definability for Description Logics via Sequents
by: Lyon, Tim S., et al.
Published: (2024)
by: Lyon, Tim S., et al.
Published: (2024)
Agentic Confidence Calibration
by: Zhang, Jiaxin, et al.
Published: (2026)
by: Zhang, Jiaxin, et al.
Published: (2026)
Belief Filtering for Epistemic Control in Linguistic State Space
by: Dumbrava, Sebastian
Published: (2025)
by: Dumbrava, Sebastian
Published: (2025)
Examining Independence in Ensemble Sentiment Analysis: A Study on the Limits of Large Language Models Using the Condorcet Jury Theorem
by: Lefort, Baptiste, et al.
Published: (2024)
by: Lefort, Baptiste, et al.
Published: (2024)
12 Angry AI Agents: Evaluating Multi-Agent LLM Decision-Making Through Cinematic Jury Deliberation
by: Ersoz, Ahmet Bahaddin
Published: (2026)
by: Ersoz, Ahmet Bahaddin
Published: (2026)
The Accountability Horizon: An Impossibility Theorem for Governing Human-Agent Collectives
by: Tibebu, Haileleol
Published: (2026)
by: Tibebu, Haileleol
Published: (2026)
MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools
by: Subramani, Nishant, et al.
Published: (2025)
by: Subramani, Nishant, et al.
Published: (2025)
Architecting Trust in Artificial Epistemic Agents
by: Marchal, Nahema, et al.
Published: (2026)
by: Marchal, Nahema, et al.
Published: (2026)
When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems
by: Wang, Zehao, et al.
Published: (2026)
by: Wang, Zehao, et al.
Published: (2026)
Towards Trustworthy Report Generation: A Deep Research Agent with Progressive Confidence Estimation and Calibration
by: Yuan, Yi, et al.
Published: (2026)
by: Yuan, Yi, et al.
Published: (2026)
Confidence Calibration in Large Language Models
by: Michael, Noam, et al.
Published: (2026)
by: Michael, Noam, et al.
Published: (2026)
Confidence Calibration of Classifiers with Many Classes
by: LeCoz, Adrien, et al.
Published: (2024)
by: LeCoz, Adrien, et al.
Published: (2024)
Attractor Geometry of Transformer Memory: From Conflict Arbitration to Confident Hallucination
by: Liang, Qiyao, et al.
Published: (2026)
by: Liang, Qiyao, et al.
Published: (2026)
Calibrated Language Models Must Hallucinate
by: Kalai, Adam Tauman, et al.
Published: (2023)
by: Kalai, Adam Tauman, et al.
Published: (2023)
Asking Is Not Enough: Protocol Sensitivity in LLM Confidence Calibration
by: Kim, Hankyeol, et al.
Published: (2026)
by: Kim, Hankyeol, et al.
Published: (2026)
Calibrated Trust in Dealing with LLM Hallucinations: A Qualitative Study
by: Ryser, Adrian, et al.
Published: (2025)
by: Ryser, Adrian, et al.
Published: (2025)
Refine and Align: Confidence Calibration through Multi-Agent Interaction in VQA
by: Pandey, Ayush, et al.
Published: (2025)
by: Pandey, Ayush, et al.
Published: (2025)
Confidence Calibration under Ambiguous Ground Truth
by: Tao, Linwei, et al.
Published: (2026)
by: Tao, Linwei, et al.
Published: (2026)
Calibrating Verbalized Confidence with Self-Generated Distractors
by: Wang, Victor, et al.
Published: (2025)
by: Wang, Victor, et al.
Published: (2025)
Fact-Level Confidence Calibration and Self-Correction
by: Yuan, Yige, et al.
Published: (2024)
by: Yuan, Yige, et al.
Published: (2024)
The First Token Knows: Single-Decode Confidence for Hallucination Detection
by: Gabriel, Mina
Published: (2026)
by: Gabriel, Mina
Published: (2026)
Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain
by: Asadi, Mohammad, et al.
Published: (2026)
by: Asadi, Mohammad, et al.
Published: (2026)
Jury: A Comprehensive Evaluation Toolkit
by: Cavusoglu, Devrim, et al.
Published: (2023)
by: Cavusoglu, Devrim, et al.
Published: (2023)
Double-Calibration: Towards Reliable LLMs via Calibrating Knowledge and Reasoning Confidence
by: Lu, Yuyin, et al.
Published: (2026)
by: Lu, Yuyin, et al.
Published: (2026)
The Confidence Gate Theorem: When Should Ranked Decision Systems Abstain?
by: Doku, Ronald
Published: (2026)
by: Doku, Ronald
Published: (2026)
A Survey of Confidence Estimation and Calibration in Large Language Models
by: Geng, Jiahui, et al.
Published: (2023)
by: Geng, Jiahui, et al.
Published: (2023)
I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation
by: Zong, Haotian, et al.
Published: (2026)
by: Zong, Haotian, et al.
Published: (2026)
An Epistemic Perspective on Agent Awareness
by: Naumov, Pavel, et al.
Published: (2025)
by: Naumov, Pavel, et al.
Published: (2025)
Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction
by: Freeman, Brian, et al.
Published: (2026)
by: Freeman, Brian, et al.
Published: (2026)
Don't Think Twice! Over-Reasoning Impairs Confidence Calibration
by: Lacombe, Romain, et al.
Published: (2025)
by: Lacombe, Romain, et al.
Published: (2025)
Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator
by: Luo, Beier, et al.
Published: (2025)
by: Luo, Beier, et al.
Published: (2025)
On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models
by: Seo, Hoigi, et al.
Published: (2025)
by: Seo, Hoigi, et al.
Published: (2025)
Modeling Epistemic Uncertainty in Social Perception via Rashomon Set Agents
by: Yang, Jinming, et al.
Published: (2026)
by: Yang, Jinming, et al.
Published: (2026)
Mitigating LLM Hallucination via Behaviorally Calibrated Reinforcement Learning
by: Wu, Jiayun, et al.
Published: (2025)
by: Wu, Jiayun, et al.
Published: (2025)
PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning
by: Baidya, Madhav S
Published: (2026)
by: Baidya, Madhav S
Published: (2026)
CALICO: Confident Active Learning with Integrated Calibration
by: Querol, Lorenzo S., et al.
Published: (2024)
by: Querol, Lorenzo S., et al.
Published: (2024)
The Silent Scholar Problem: A Probabilistic Framework for Breaking Epistemic Asymmetry in LLM Agents
by: Chong, Zan-Kai, et al.
Published: (2025)
by: Chong, Zan-Kai, et al.
Published: (2025)
A Minimal Agent for Automated Theorem Proving
by: Requena, Borja, et al.
Published: (2026)
by: Requena, Borja, et al.
Published: (2026)
Fine-grained Approaches for Confidence Calibration of LLMs in Automated Code Revision
by: Lin, Hong Yi, et al.
Published: (2026)
by: Lin, Hong Yi, et al.
Published: (2026)
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals
by: Torrielli, Federico, et al.
Published: (2026)
by: Torrielli, Federico, et al.
Published: (2026)
Similar Items
-
Constructive Interpolation and Concept-Based Beth Definability for Description Logics via Sequents
by: Lyon, Tim S., et al.
Published: (2024) -
Agentic Confidence Calibration
by: Zhang, Jiaxin, et al.
Published: (2026) -
Belief Filtering for Epistemic Control in Linguistic State Space
by: Dumbrava, Sebastian
Published: (2025) -
Examining Independence in Ensemble Sentiment Analysis: A Study on the Limits of Large Language Models Using the Condorcet Jury Theorem
by: Lefort, Baptiste, et al.
Published: (2024) -
12 Angry AI Agents: Evaluating Multi-Agent LLM Decision-Making Through Cinematic Jury Deliberation
by: Ersoz, Ahmet Bahaddin
Published: (2026)