Saved in:
| Main Authors: | Yang, Pengyue, Wen, Jiawen, Jin, Haolin, Huang, Linghan, Chen, Huaming, Chen, Ling |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00977 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
by: Huang, Linghan, et al.
Published: (2025)
by: Huang, Linghan, et al.
Published: (2025)
NSW-EPNews: A News-Augmented Benchmark for Electricity Price Forecasting with LLMs
by: Bi, Zhaoge, et al.
Published: (2025)
by: Bi, Zhaoge, et al.
Published: (2025)
When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal
by: Phalod, Aditya Ajay
Published: (2026)
by: Phalod, Aditya Ajay
Published: (2026)
Large Language Model Confidence Estimation via Black-Box Access
by: Pedapati, Tejaswini, et al.
Published: (2024)
by: Pedapati, Tejaswini, et al.
Published: (2024)
ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models
by: Li, Chen, et al.
Published: (2026)
by: Li, Chen, et al.
Published: (2026)
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future
by: Jin, Haolin, et al.
Published: (2024)
by: Jin, Haolin, et al.
Published: (2024)
Human-aligned AI Model Cards with Weighted Hierarchy Architecture
by: Yang, Pengyue, et al.
Published: (2025)
by: Yang, Pengyue, et al.
Published: (2025)
Confidence in the Reasoning of Large Language Models
by: Pawitan, Yudi, et al.
Published: (2024)
by: Pawitan, Yudi, et al.
Published: (2024)
A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models
by: Yuan, Mingruo, et al.
Published: (2025)
by: Yuan, Mingruo, et al.
Published: (2025)
Feature-Selective Representation Misdirection for Machine Unlearning
by: Chen, Taozhao, et al.
Published: (2025)
by: Chen, Taozhao, et al.
Published: (2025)
What You See Is Not Always What You Get: Evaluating GPT's Comprehension of Source Code
by: Wen, Jiawen, et al.
Published: (2024)
by: Wen, Jiawen, et al.
Published: (2024)
Confidence over Time: Confidence Calibration with Temporal Logic for Large Language Model Reasoning
by: Mao, Zhenjiang, et al.
Published: (2026)
by: Mao, Zhenjiang, et al.
Published: (2026)
Self-Training Large Language Models with Confident Reasoning
by: Jang, Hyosoon, et al.
Published: (2025)
by: Jang, Hyosoon, et al.
Published: (2025)
Binary Autoencoder for Mechanistic Interpretability of Large Language Models
by: Cho, Hakaze, et al.
Published: (2025)
by: Cho, Hakaze, et al.
Published: (2025)
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
by: Chen, Guoxuan, et al.
Published: (2024)
by: Chen, Guoxuan, et al.
Published: (2024)
Deterministic Differentiable Structured Pruning for Large Language Models
by: Huang, Weiyu, et al.
Published: (2026)
by: Huang, Weiyu, et al.
Published: (2026)
Fact or Guesswork? Evaluating Large Language Models' Medical Knowledge with Structured One-Hop Judgments
by: Li, Jiaxi, et al.
Published: (2025)
by: Li, Jiaxi, et al.
Published: (2025)
Confidence-Aware Sub-Structure Beam Search (CABS): Mitigating Hallucination in Structured Data Generation with Large Language Models
by: Wei, Chengwei, et al.
Published: (2024)
by: Wei, Chengwei, et al.
Published: (2024)
Graph-based Confidence Calibration for Large Language Models
by: Li, Yukun, et al.
Published: (2024)
by: Li, Yukun, et al.
Published: (2024)
Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection
by: Lim, Ying Fu, et al.
Published: (2025)
by: Lim, Ying Fu, et al.
Published: (2025)
Confidence Calibration in Large Language Model-Based Entity Matching
by: Kamsteeg, Iris, et al.
Published: (2025)
by: Kamsteeg, Iris, et al.
Published: (2025)
Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?
by: He, Jianfeng, et al.
Published: (2024)
by: He, Jianfeng, et al.
Published: (2024)
Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models
by: Kumar, Abhishek, et al.
Published: (2024)
by: Kumar, Abhishek, et al.
Published: (2024)
RGD: Multi-LLM Based Agent Debugger via Refinement and Generation Guidance
by: Jin, Haolin, et al.
Published: (2024)
by: Jin, Haolin, et al.
Published: (2024)
CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models
by: Huang, Weiyu, et al.
Published: (2025)
by: Huang, Weiyu, et al.
Published: (2025)
Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models
by: Lin, Zhen, et al.
Published: (2023)
by: Lin, Zhen, et al.
Published: (2023)
Self-ensemble: Mitigating Confidence Mis-calibration for Large Language Models
by: Xu, Zicheng, et al.
Published: (2025)
by: Xu, Zicheng, et al.
Published: (2025)
One Token to Fool LLM-as-a-Judge
by: Zhao, Yulai, et al.
Published: (2025)
by: Zhao, Yulai, et al.
Published: (2025)
Direct Quantized Training of Language Models with Stochastic Rounding
by: Zhao, Kaiyan, et al.
Published: (2024)
by: Zhao, Kaiyan, et al.
Published: (2024)
Confidence-Modulated Speculative Decoding for Large Language Models
by: Sen, Jaydip, et al.
Published: (2025)
by: Sen, Jaydip, et al.
Published: (2025)
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization
by: Liu, Xuefeng, et al.
Published: (2025)
by: Liu, Xuefeng, et al.
Published: (2025)
GOFA: A Generative One-For-All Model for Joint Graph Language Modeling
by: Kong, Lecheng, et al.
Published: (2024)
by: Kong, Lecheng, et al.
Published: (2024)
On the Convergence of Zeroth-Order Federated Tuning for Large Language Models
by: Ling, Zhenqing, et al.
Published: (2024)
by: Ling, Zhenqing, et al.
Published: (2024)
Confidence Geometry Reveals Trace-Level Correctness in Large Language Model Reasoning
by: Liu, Shuo, et al.
Published: (2026)
by: Liu, Shuo, et al.
Published: (2026)
Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language Models
by: Mao, Zhenjiang, et al.
Published: (2026)
by: Mao, Zhenjiang, et al.
Published: (2026)
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
by: Li, Zichong, et al.
Published: (2025)
by: Li, Zichong, et al.
Published: (2025)
Enhancing Trust in Large Language Models via Uncertainty-Calibrated Fine-Tuning
by: Krishnan, Ranganath, et al.
Published: (2024)
by: Krishnan, Ranganath, et al.
Published: (2024)
The Confidence Manifold: Geometric Structure of Correctness Representations in Language Models
by: Cho, Seonglae, et al.
Published: (2026)
by: Cho, Seonglae, et al.
Published: (2026)
DLM-One: Diffusion Language Models for One-Step Sequence Generation
by: Chen, Tianqi, et al.
Published: (2025)
by: Chen, Tianqi, et al.
Published: (2025)
A Bayesian Interpretation of Adaptive Low-Rank Adaptation
by: Chen, Haolin, et al.
Published: (2024)
by: Chen, Haolin, et al.
Published: (2024)
Similar Items
-
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
by: Huang, Linghan, et al.
Published: (2025) -
NSW-EPNews: A News-Augmented Benchmark for Electricity Price Forecasting with LLMs
by: Bi, Zhaoge, et al.
Published: (2025) -
When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal
by: Phalod, Aditya Ajay
Published: (2026) -
Large Language Model Confidence Estimation via Black-Box Access
by: Pedapati, Tejaswini, et al.
Published: (2024) -
ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models
by: Li, Chen, et al.
Published: (2026)