Saved in:
| Main Authors: | Petullo, James, George, Sonny, Cashman, Dylan, Xue, Nianwen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.08070 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation
by: Petullo, James, et al.
Published: (2026)
by: Petullo, James, et al.
Published: (2026)
Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction
by: George, Sonny, et al.
Published: (2024)
by: George, Sonny, et al.
Published: (2024)
Confidence Improves Self-Consistency in LLMs
by: Taubenfeld, Amir, et al.
Published: (2025)
by: Taubenfeld, Amir, et al.
Published: (2025)
Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning
by: Oh, Jungsuk, et al.
Published: (2025)
by: Oh, Jungsuk, et al.
Published: (2025)
The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure
by: Li, Yubo, et al.
Published: (2026)
by: Li, Yubo, et al.
Published: (2026)
How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning
by: Chen, Haoyang, et al.
Published: (2026)
by: Chen, Haoyang, et al.
Published: (2026)
MPN: Leveraging Multilingual Patch Neuron for Cross-lingual Model Editing
by: Si, Nianwen, et al.
Published: (2024)
by: Si, Nianwen, et al.
Published: (2024)
Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills
by: Wang, Changsheng, et al.
Published: (2025)
by: Wang, Changsheng, et al.
Published: (2025)
Maximizing Confidence Alone Improves Reasoning
by: Prabhudesai, Mihir, et al.
Published: (2025)
by: Prabhudesai, Mihir, et al.
Published: (2025)
Tiny-QMoE
by: Cashman, Jack, et al.
Published: (2025)
by: Cashman, Jack, et al.
Published: (2025)
Filtered Reasoning Score: Evaluating Reasoning Quality on a Model's Most-Confident Traces
by: Pathak, Manas, et al.
Published: (2026)
by: Pathak, Manas, et al.
Published: (2026)
Reasoning or Fluency? Dissecting Probabilistic Confidence in Best-of-N Selection
by: Kim, Hojin, et al.
Published: (2026)
by: Kim, Hojin, et al.
Published: (2026)
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
by: Li, Yiwei, et al.
Published: (2025)
by: Li, Yiwei, et al.
Published: (2025)
Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
by: Lee, Jaehyeok, et al.
Published: (2024)
by: Lee, Jaehyeok, et al.
Published: (2024)
Improving Score Reliability of Multiple Choice Benchmarks with Consistency Evaluation and Altered Answer Choices
by: Cavalin, Paulo, et al.
Published: (2025)
by: Cavalin, Paulo, et al.
Published: (2025)
Predicting Winning Captions for Weekly New Yorker Comics
by: Cao, Stanley, et al.
Published: (2024)
by: Cao, Stanley, et al.
Published: (2024)
A Generalised Approach for Encoding and Reasoning with Qualitative Theories in Answer Set Programming
by: Baryannis, George, et al.
Published: (2020)
by: Baryannis, George, et al.
Published: (2020)
Sandwich Reasoning: An Answer-Reasoning-Answer Approach for Low-Latency Query Correction
by: Zhang, Chen, et al.
Published: (2026)
by: Zhang, Chen, et al.
Published: (2026)
Reflective Confidence: Correcting Reasoning Flaws via Online Self-Correction
by: Zeng, Qinglin, et al.
Published: (2025)
by: Zeng, Qinglin, et al.
Published: (2025)
PiCSAR: Probabilistic Confidence Selection And Ranking for Reasoning Chains
by: Leang, Joshua Ong Jun, et al.
Published: (2025)
by: Leang, Joshua Ong Jun, et al.
Published: (2025)
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)
by: Hammoud, Hasan Abed Al Kader, et al.
Published: (2025)
The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models
by: Tan, Xue Wen, et al.
Published: (2025)
by: Tan, Xue Wen, et al.
Published: (2025)
Does Inference Scaling Improve Reasoning Faithfulness? A Multi-Model Analysis of Self-Consistency Tradeoffs
by: Mehta, Deep
Published: (2026)
by: Mehta, Deep
Published: (2026)
Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces
by: He, Chen, et al.
Published: (2026)
by: He, Chen, et al.
Published: (2026)
Roundtable Policy: Confidence-Weighted-Consensus Aggregation Improves Multi-Agent-System Reasoning
by: Yao, Yu, et al.
Published: (2025)
by: Yao, Yu, et al.
Published: (2025)
FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean4
by: Yao, Jiarui, et al.
Published: (2025)
by: Yao, Jiarui, et al.
Published: (2025)
Self-Consistency Boosts Calibration for Math Reasoning
by: Wang, Ante, et al.
Published: (2024)
by: Wang, Ante, et al.
Published: (2024)
Debiased Multimodal Personality Understanding through Dual Causal Intervention
by: Zhu, Yangfu, et al.
Published: (2026)
by: Zhu, Yangfu, et al.
Published: (2026)
ReasonOps: Operator Segmentation for LLM Reasoning Traces
by: Lee, Daniel, et al.
Published: (2026)
by: Lee, Daniel, et al.
Published: (2026)
Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models
by: Zhou, Xin, et al.
Published: (2025)
by: Zhou, Xin, et al.
Published: (2025)
Plantain: Plan-Answer Interleaved Reasoning
by: Liang, Anthony, et al.
Published: (2025)
by: Liang, Anthony, et al.
Published: (2025)
Optimal Bayesian Stopping for Efficient Inference of Consistent LLM Answers
by: Huang, Jingkai, et al.
Published: (2026)
by: Huang, Jingkai, et al.
Published: (2026)
Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards
by: Kim, Seungwook, et al.
Published: (2026)
by: Kim, Seungwook, et al.
Published: (2026)
Reasoning about Study Regulations in Answer Set Programming
by: Hahn, Susana, et al.
Published: (2024)
by: Hahn, Susana, et al.
Published: (2024)
S2Vec: Self-Supervised Geospatial Embeddings for the Built Environment
by: Choudhury, Shushman, et al.
Published: (2025)
by: Choudhury, Shushman, et al.
Published: (2025)
CASK: Core-Aware Selective KV Compression for Reasoning Traces
by: Kim, Buseong, et al.
Published: (2026)
by: Kim, Buseong, et al.
Published: (2026)
Beyond Self-Consistency: Ensemble Reasoning Boosts Consistency and Accuracy of LLMs in Cancer Staging
by: Chang, Chia-Hsuan, et al.
Published: (2024)
by: Chang, Chia-Hsuan, et al.
Published: (2024)
Do Cognitively Interpretable Reasoning Traces Improve LLM Performance?
by: Bhambri, Siddhant, et al.
Published: (2025)
by: Bhambri, Siddhant, et al.
Published: (2025)
Incentivizing LLMs to Self-Verify Their Answers
by: Zhang, Fuxiang, et al.
Published: (2025)
by: Zhang, Fuxiang, et al.
Published: (2025)
Reasoning Aware Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling
by: Wan, Guangya, et al.
Published: (2024)
by: Wan, Guangya, et al.
Published: (2024)
Similar Items
-
CA-SQL: Complexity-Aware Inference Time Reasoning for Text-to-SQL via Exploration and Compute Budget Allocation
by: Petullo, James, et al.
Published: (2026) -
Probing the Capacity of Language Model Agents to Operationalize Disparate Experiential Context Despite Distraction
by: George, Sonny, et al.
Published: (2024) -
Confidence Improves Self-Consistency in LLMs
by: Taubenfeld, Amir, et al.
Published: (2025) -
Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning
by: Oh, Jungsuk, et al.
Published: (2025) -
The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure
by: Li, Yubo, et al.
Published: (2026)