Saved in:
| Main Authors: | Zhao, Jiachen, Sun, Yiyou, Shi, Weiyan, Song, Dawn |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.24941 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games
by: Zhang, Xiaoqing, et al.
Published: (2025)
by: Zhang, Xiaoqing, et al.
Published: (2025)
Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents
by: Li, Xu, et al.
Published: (2026)
by: Li, Xu, et al.
Published: (2026)
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
by: Sun, Yiyou, et al.
Published: (2025)
by: Sun, Yiyou, et al.
Published: (2025)
Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling?
by: Wu, Mingyuan, et al.
Published: (2025)
by: Wu, Mingyuan, et al.
Published: (2025)
How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns
by: Bai, Haoyue, et al.
Published: (2025)
by: Bai, Haoyue, et al.
Published: (2025)
Deep Thinking by Markov Chain of Continuous Thoughts
by: Liu, Jiayu, et al.
Published: (2025)
by: Liu, Jiayu, et al.
Published: (2025)
RL Grokking Recipe: How Does RL Unlock and Transfer New Algorithms in LLMs?
by: Sun, Yiyou, et al.
Published: (2025)
by: Sun, Yiyou, et al.
Published: (2025)
When and How Does In-Distribution Label Help Out-of-Distribution Detection?
by: Du, Xuefeng, et al.
Published: (2024)
by: Du, Xuefeng, et al.
Published: (2024)
Temporal Chain of Thought: Long-Video Understanding by Thinking in Frames
by: Arnab, Anurag, et al.
Published: (2025)
by: Arnab, Anurag, et al.
Published: (2025)
SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning
by: Zhou, Kaiwen, et al.
Published: (2025)
by: Zhou, Kaiwen, et al.
Published: (2025)
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model
by: Zhou, Hengguang, et al.
Published: (2025)
by: Zhou, Hengguang, et al.
Published: (2025)
Let Me Think! A Long Chain-of-Thought Can Be Worth Exponentially Many Short Ones
by: Mirtaheri, Parsa, et al.
Published: (2025)
by: Mirtaheri, Parsa, et al.
Published: (2025)
Is Chain-of-Thought Really Not Explainability? Chain-of-Thought Can Be Faithful without Hint Verbalization
by: Zaman, Kerem, et al.
Published: (2025)
by: Zaman, Kerem, et al.
Published: (2025)
Where's the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content
by: Bai, Haoyue, et al.
Published: (2025)
by: Bai, Haoyue, et al.
Published: (2025)
Output Supervision Can Obfuscate the Chain of Thought
by: Drori, Jacob, et al.
Published: (2025)
by: Drori, Jacob, et al.
Published: (2025)
Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice
by: Wang, Jiachen T., et al.
Published: (2025)
by: Wang, Jiachen T., et al.
Published: (2025)
Data Shapley in One Training Run
by: Wang, Jiachen T., et al.
Published: (2024)
by: Wang, Jiachen T., et al.
Published: (2024)
ExpThink: Experience-Guided Reinforcement Learning for Adaptive Chain-of-Thought Compression
by: Bian, Tingcheng, et al.
Published: (2026)
by: Bian, Tingcheng, et al.
Published: (2026)
Think When You Need: Self-Adaptive Chain-of-Thought Learning
by: Yang, Junjie, et al.
Published: (2025)
by: Yang, Junjie, et al.
Published: (2025)
DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton
by: Sun, Yiyou, et al.
Published: (2024)
by: Sun, Yiyou, et al.
Published: (2024)
Capturing the Temporal Dependence of Training Data Influence
by: Wang, Jiachen T., et al.
Published: (2024)
by: Wang, Jiachen T., et al.
Published: (2024)
Can Machines Learn the True Probabilities?
by: Kim, Jinsook
Published: (2024)
by: Kim, Jinsook
Published: (2024)
Think Consistently, Reason Efficiently: Energy-Based Calibration for Implicit Chain-of-Thought
by: Chen, Zhikang, et al.
Published: (2025)
by: Chen, Zhikang, et al.
Published: (2025)
Dynamic Chain-of-Thought: Towards Adaptive Deep Reasoning
by: Wang, Libo
Published: (2025)
by: Wang, Libo
Published: (2025)
SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought
by: Batra, Shourya, et al.
Published: (2025)
by: Batra, Shourya, et al.
Published: (2025)
Transformers Provably Learn to Internalize Chain-of-Thought
by: Huang, Yixiao, et al.
Published: (2026)
by: Huang, Yixiao, et al.
Published: (2026)
Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot
by: Cheng, Xiang, et al.
Published: (2025)
by: Cheng, Xiang, et al.
Published: (2025)
Modern Hopfield Networks Require Chain-of-Thought to Solve $\mathsf{NC}^1$-Hard Problems
by: Cao, Yang, et al.
Published: (2024)
by: Cao, Yang, et al.
Published: (2024)
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
by: Ye, Jiacheng, et al.
Published: (2024)
by: Ye, Jiacheng, et al.
Published: (2024)
Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs
by: Cai, Will, et al.
Published: (2025)
by: Cai, Will, et al.
Published: (2025)
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
by: Qin, Yiming, et al.
Published: (2025)
by: Qin, Yiming, et al.
Published: (2025)
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
by: Hu, Shengran, et al.
Published: (2023)
by: Hu, Shengran, et al.
Published: (2023)
Quantifying True Robustness: Synonymity-Weighted Similarity for Trustworthy XAI Evaluation
by: Burger, Christopher
Published: (2025)
by: Burger, Christopher
Published: (2025)
DarkMind: Latent Chain-of-Thought Backdoor in Customized LLMs
by: Guo, Zhen, et al.
Published: (2025)
by: Guo, Zhen, et al.
Published: (2025)
Can DeepFake Speech be Reliably Detected?
by: Liu, Hongbin, et al.
Published: (2024)
by: Liu, Hongbin, et al.
Published: (2024)
Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving
by: Yang, Tianyun, et al.
Published: (2025)
by: Yang, Tianyun, et al.
Published: (2025)
AhaRobot: A Low-Cost Open-Source Bimanual Mobile Manipulator for Embodied AI
by: Cui, Haiqin, et al.
Published: (2025)
by: Cui, Haiqin, et al.
Published: (2025)
True Self-Avoiding Walk for Accelerating Markov-Chain Monte Carlo Integration
by: Qinghua, et al.
Published: (2026)
by: Qinghua, et al.
Published: (2026)
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
by: Tang, Yuntian, et al.
Published: (2026)
by: Tang, Yuntian, et al.
Published: (2026)
On Learning Verifiers and Implications to Chain-of-Thought Reasoning
by: Balcan, Maria-Florina, et al.
Published: (2025)
by: Balcan, Maria-Florina, et al.
Published: (2025)
Similar Items
-
Divide-Fuse-Conquer: Eliciting "Aha Moments" in Multi-Scenario Games
by: Zhang, Xiaoqing, et al.
Published: (2025) -
Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents
by: Li, Xu, et al.
Published: (2026) -
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
by: Sun, Yiyou, et al.
Published: (2025) -
Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling?
by: Wu, Mingyuan, et al.
Published: (2025) -
How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns
by: Bai, Haoyue, et al.
Published: (2025)