Guardado en:
| Autores principales: | Zhang, Chuang, Zhu, Zizhen, Wei, Yihao, Tian, Bing, Liu, Junyi, Wang, Henan, Wang, Xavier, Liu, Yaxiao |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.03752 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Graph-based Confidence Calibration for Large Language Models
por: Li, Yukun, et al.
Publicado: (2024)
por: Li, Yukun, et al.
Publicado: (2024)
Efficient Long CoT Reasoning in Small Language Models
por: Wang, Zhaoyang, et al.
Publicado: (2025)
por: Wang, Zhaoyang, et al.
Publicado: (2025)
VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning
por: Xiao, Wenyi, et al.
Publicado: (2026)
por: Xiao, Wenyi, et al.
Publicado: (2026)
A Survey of Confidence Estimation and Calibration in Large Language Models
por: Geng, Jiahui, et al.
Publicado: (2023)
por: Geng, Jiahui, et al.
Publicado: (2023)
Distilling Mathematical Reasoning Capabilities into Small Language Models
por: Zhu, Xunyu, et al.
Publicado: (2024)
por: Zhu, Xunyu, et al.
Publicado: (2024)
Double-Calibration: Towards Reliable LLMs via Calibrating Knowledge and Reasoning Confidence
por: Lu, Yuyin, et al.
Publicado: (2026)
por: Lu, Yuyin, et al.
Publicado: (2026)
GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language Models
por: Yuan, Zike, et al.
Publicado: (2024)
por: Yuan, Zike, et al.
Publicado: (2024)
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models
por: Stengel-Eskin, Elias, et al.
Publicado: (2024)
por: Stengel-Eskin, Elias, et al.
Publicado: (2024)
An Enhanced Prompt-Based LLM Reasoning Scheme via Knowledge Graph-Integrated Collaboration
por: Li, Yihao, et al.
Publicado: (2024)
por: Li, Yihao, et al.
Publicado: (2024)
Collaborative Stance Detection via Small-Large Language Model Consistency Verification
por: Yan, Yu, et al.
Publicado: (2025)
por: Yan, Yu, et al.
Publicado: (2025)
LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?
por: Wang, Jingyuan, et al.
Publicado: (2025)
por: Wang, Jingyuan, et al.
Publicado: (2025)
SEAL: Steerable Reasoning Calibration of Large Language Models for Free
por: Chen, Runjin, et al.
Publicado: (2025)
por: Chen, Runjin, et al.
Publicado: (2025)
Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning
por: Cheng, Xiaoxue, et al.
Publicado: (2025)
por: Cheng, Xiaoxue, et al.
Publicado: (2025)
The Dunning-Kruger Effect in Large Language Models: An Empirical Study of Confidence Calibration
por: Ghosh, Sudipta, et al.
Publicado: (2026)
por: Ghosh, Sudipta, et al.
Publicado: (2026)
Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models
por: Chhikara, Prateek
Publicado: (2025)
por: Chhikara, Prateek
Publicado: (2025)
A Survey on Collaborating Small and Large Language Models for Performance, Cost-effectiveness, Cloud-edge Privacy, and Trustworthiness
por: Wang, Fali, et al.
Publicado: (2025)
por: Wang, Fali, et al.
Publicado: (2025)
A Survey on Large Language Models for Mathematical Reasoning
por: Wang, Peng-Yuan, et al.
Publicado: (2025)
por: Wang, Peng-Yuan, et al.
Publicado: (2025)
How Confident Is the First Token? An Uncertainty-Calibrated Prompt Optimization Framework for Large Language Model Classification and Understanding
por: Chen, Wei, et al.
Publicado: (2026)
por: Chen, Wei, et al.
Publicado: (2026)
Beyond Confidence: The Rhythms of Reasoning in Generative Models
por: Liu, Deyuan, et al.
Publicado: (2026)
por: Liu, Deyuan, et al.
Publicado: (2026)
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
por: Wang, Rui, et al.
Publicado: (2025)
por: Wang, Rui, et al.
Publicado: (2025)
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models
por: Wang, Xinyuan, et al.
Publicado: (2025)
por: Wang, Xinyuan, et al.
Publicado: (2025)
MONICA: Real-Time Monitoring and Calibration of Chain-of-Thought Sycophancy in Large Reasoning Models
por: Hu, Jingyu, et al.
Publicado: (2025)
por: Hu, Jingyu, et al.
Publicado: (2025)
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
por: Xie, Congkai, et al.
Publicado: (2025)
por: Xie, Congkai, et al.
Publicado: (2025)
Structured Chemistry Reasoning with Large Language Models
por: Ouyang, Siru, et al.
Publicado: (2023)
por: Ouyang, Siru, et al.
Publicado: (2023)
CARE-RFT: Confidence-Anchored Reinforcement Finetuning for Reliable Reasoning in Large Language Models
por: Li, Shuozhe, et al.
Publicado: (2026)
por: Li, Shuozhe, et al.
Publicado: (2026)
Mitigating Overthinking in Large Reasoning Language Models via Reasoning Path Deviation Monitoring
por: Guan, Weixin, et al.
Publicado: (2026)
por: Guan, Weixin, et al.
Publicado: (2026)
Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models
por: Li, Junyi, et al.
Publicado: (2025)
por: Li, Junyi, et al.
Publicado: (2025)
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
por: Xiong, Kai, et al.
Publicado: (2023)
por: Xiong, Kai, et al.
Publicado: (2023)
Rewarding Doubt: A Reinforcement Learning Approach to Calibrated Confidence Expression of Large Language Models
por: Bani-Harouni, David, et al.
Publicado: (2025)
por: Bani-Harouni, David, et al.
Publicado: (2025)
TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
por: Chu, Zheng, et al.
Publicado: (2023)
por: Chu, Zheng, et al.
Publicado: (2023)
SmartThinker: Progressive Chain-of-Thought Length Calibration for Efficient Large Language Model Reasoning
por: Hu, Chenzhi, et al.
Publicado: (2026)
por: Hu, Chenzhi, et al.
Publicado: (2026)
When Can Large Reasoning Models Save Thinking? Mechanistic Analysis of Behavioral Divergence in Reasoning
por: Zhu, Rongzhi, et al.
Publicado: (2025)
por: Zhu, Rongzhi, et al.
Publicado: (2025)
Cost-efficient Knowledge-based Question Answering with Large Language Models
por: Dong, Junnan, et al.
Publicado: (2024)
por: Dong, Junnan, et al.
Publicado: (2024)
Don't Think Twice! Over-Reasoning Impairs Confidence Calibration
por: Lacombe, Romain, et al.
Publicado: (2025)
por: Lacombe, Romain, et al.
Publicado: (2025)
Calibrating Verbalized Confidence with Self-Generated Distractors
por: Wang, Victor, et al.
Publicado: (2025)
por: Wang, Victor, et al.
Publicado: (2025)
Improving Large Models with Small models: Lower Costs and Better Performance
por: Chen, Dong, et al.
Publicado: (2024)
por: Chen, Dong, et al.
Publicado: (2024)
DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models
por: Zhu, Yakun, et al.
Publicado: (2025)
por: Zhu, Yakun, et al.
Publicado: (2025)
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
por: Liu, Xinyi, et al.
Publicado: (2025)
por: Liu, Xinyi, et al.
Publicado: (2025)
Self-Anchoring Calibration Drift in Large Language Models: How Multi-Turn Conversations Reshape Model Confidence
por: Harshavardhan
Publicado: (2026)
por: Harshavardhan
Publicado: (2026)
Mixture of Reasonings: Teach Large Language Models to Reason with Adaptive Strategies
por: Xiong, Tao, et al.
Publicado: (2025)
por: Xiong, Tao, et al.
Publicado: (2025)
Ejemplares similares
-
Graph-based Confidence Calibration for Large Language Models
por: Li, Yukun, et al.
Publicado: (2024) -
Efficient Long CoT Reasoning in Small Language Models
por: Wang, Zhaoyang, et al.
Publicado: (2025) -
VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning
por: Xiao, Wenyi, et al.
Publicado: (2026) -
A Survey of Confidence Estimation and Calibration in Large Language Models
por: Geng, Jiahui, et al.
Publicado: (2023) -
Distilling Mathematical Reasoning Capabilities into Small Language Models
por: Zhu, Xunyu, et al.
Publicado: (2024)