Saved in:
| Main Authors: | Chen, Kang, Wang, Yaoning, Xiong, Kai, Feng, Zhuoka, Sun, Wenhe, Chen, Haotian, Cao, Yixin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.26277 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Thinking Out Loud: Do Reasoning Models Know When They're Right?
by: Zeng, Qingcheng, et al.
Published: (2025)
by: Zeng, Qingcheng, et al.
Published: (2025)
NEX: Neuron Explore-Exploit Scoring for Label-Free Chain-of-Thought Selection and Model Ranking
by: Chen, Kang, et al.
Published: (2026)
by: Chen, Kang, et al.
Published: (2026)
Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification
by: Zhang, Anqi, et al.
Published: (2025)
by: Zhang, Anqi, et al.
Published: (2025)
Do Language Models Know When They're Hallucinating References?
by: Agrawal, Ayush, et al.
Published: (2023)
by: Agrawal, Ayush, et al.
Published: (2023)
ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging
by: Feng, Zhuoka, et al.
Published: (2026)
by: Feng, Zhuoka, et al.
Published: (2026)
Do Androids Know They're Only Dreaming of Electric Sheep?
by: CH-Wang, Sky, et al.
Published: (2023)
by: CH-Wang, Sky, et al.
Published: (2023)
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
by: Cao, Yixin, et al.
Published: (2025)
by: Cao, Yixin, et al.
Published: (2025)
Thinking Traps in Long Chain-of-Thought: A Measurable Study and Trap-Aware Adaptive Restart
by: Chen, Kang, et al.
Published: (2026)
by: Chen, Kang, et al.
Published: (2026)
EffiEval: Efficient and Generalizable Model Evaluation via Capability Coverage Maximization
by: Wang, Yaoning, et al.
Published: (2025)
by: Wang, Yaoning, et al.
Published: (2025)
Cogs in a Machine, Doing What They're Meant to Do -- The AMI Submission to the WMT24 General Translation Task
by: Jasonarson, Atli, et al.
Published: (2024)
by: Jasonarson, Atli, et al.
Published: (2024)
Do Small Language Models Know When They're Wrong? Confidence-Based Cascade Scoring for Educational Assessment
by: Burleigh, Tyler
Published: (2026)
by: Burleigh, Tyler
Published: (2026)
Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs
by: Chen, Kang, et al.
Published: (2025)
by: Chen, Kang, et al.
Published: (2025)
Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer
by: Cui, Chenhang, et al.
Published: (2026)
by: Cui, Chenhang, et al.
Published: (2026)
Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
by: Pan, Haowen, et al.
Published: (2025)
by: Pan, Haowen, et al.
Published: (2025)
Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts
by: Ying, Jiahao, et al.
Published: (2023)
by: Ying, Jiahao, et al.
Published: (2023)
Do Hallucination Neurons Generalize? Evidence from Cross-Domain Transfer in LLMs
by: Vaddi, Snehit, et al.
Published: (2026)
by: Vaddi, Snehit, et al.
Published: (2026)
H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs
by: Gao, Cheng, et al.
Published: (2025)
by: Gao, Cheng, et al.
Published: (2025)
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
by: Pan, Haowen, et al.
Published: (2023)
by: Pan, Haowen, et al.
Published: (2023)
Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese
by: Zhang, Jingshen, et al.
Published: (2024)
by: Zhang, Jingshen, et al.
Published: (2024)
Do Self-Evolving Agents Forget? Capability Degradation and Preservation in Lifelong LLM Agent Adaptation
by: Yu, Ye, et al.
Published: (2026)
by: Yu, Ye, et al.
Published: (2026)
The Realignment Problem: When Right becomes Wrong in LLMs
by: Sharma, Aakash Sen, et al.
Published: (2025)
by: Sharma, Aakash Sen, et al.
Published: (2025)
The Knowledge Microscope: Features as Better Analytical Lenses than Neurons
by: Chen, Yuheng, et al.
Published: (2025)
by: Chen, Yuheng, et al.
Published: (2025)
Do LLMs Know What Is Private Internally? Probing and Steering Contextual Privacy Norms in Large Language Model Representations
by: Wang, Haoran, et al.
Published: (2026)
by: Wang, Haoran, et al.
Published: (2026)
Long Context vs. RAG for LLMs: An Evaluation and Revisits
by: Li, Xinze, et al.
Published: (2024)
by: Li, Xinze, et al.
Published: (2024)
PREMISE: Scalable and Strategic Prompt Optimization for Efficient Mathematical Reasoning in Large Models
by: Yu, Ye, et al.
Published: (2025)
by: Yu, Ye, et al.
Published: (2025)
Dissecting Role Cognition in Medical LLMs via Neuronal Ablation
by: Liang, Xun, et al.
Published: (2025)
by: Liang, Xun, et al.
Published: (2025)
When Do Language Models Endorse Limitations on Human Rights Principles?
by: Samway, Keenan, et al.
Published: (2026)
by: Samway, Keenan, et al.
Published: (2026)
Do LLMs Encode Frame Semantics? Evidence from Frame Identification
by: Chundru, Jayanth Krishna, et al.
Published: (2025)
by: Chundru, Jayanth Krishna, et al.
Published: (2025)
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
by: Xiong, Kai, et al.
Published: (2024)
by: Xiong, Kai, et al.
Published: (2024)
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
by: Xie, Jiakuan, et al.
Published: (2024)
by: Xie, Jiakuan, et al.
Published: (2024)
"Yeah Right!" -- Do LLMs Exhibit Multimodal Feature Transfer?
by: Reichman, Benjamin, et al.
Published: (2025)
by: Reichman, Benjamin, et al.
Published: (2025)
Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs
by: Zhang, Zhuoxuan, et al.
Published: (2025)
by: Zhang, Zhuoxuan, et al.
Published: (2025)
Empathy and the Right to Be an Exception: What LLMs Can and Cannot Do
by: Kidder, William, et al.
Published: (2024)
by: Kidder, William, et al.
Published: (2024)
One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
by: Cao, Pengfei, et al.
Published: (2024)
by: Cao, Pengfei, et al.
Published: (2024)
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
by: Xiong, Kai, et al.
Published: (2023)
by: Xiong, Kai, et al.
Published: (2023)
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback
by: Yu, Yaoning, et al.
Published: (2025)
by: Yu, Yaoning, et al.
Published: (2025)
Cracking Factual Knowledge: A Comprehensive Analysis of Degenerate Knowledge Neurons in Large Language Models
by: Chen, Yuheng, et al.
Published: (2024)
by: Chen, Yuheng, et al.
Published: (2024)
White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs
by: Wan, Yixin, et al.
Published: (2024)
by: Wan, Yixin, et al.
Published: (2024)
LLM Reasoning Predicts When Models Are Right: Evidence from Coding Classroom Discourse
by: Ahtisham, Bakhtawar, et al.
Published: (2026)
by: Ahtisham, Bakhtawar, et al.
Published: (2026)
XFinBench: Benchmarking LLMs in Complex Financial Problem Solving and Reasoning
by: Zhang, Zhihan, et al.
Published: (2025)
by: Zhang, Zhihan, et al.
Published: (2025)
Similar Items
-
Thinking Out Loud: Do Reasoning Models Know When They're Right?
by: Zeng, Qingcheng, et al.
Published: (2025) -
NEX: Neuron Explore-Exploit Scoring for Label-Free Chain-of-Thought Selection and Model Ranking
by: Chen, Kang, et al.
Published: (2026) -
Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification
by: Zhang, Anqi, et al.
Published: (2025) -
Do Language Models Know When They're Hallucinating References?
by: Agrawal, Ayush, et al.
Published: (2023) -
ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging
by: Feng, Zhuoka, et al.
Published: (2026)