Saved in:
| Main Authors: | Cui, Jasmine, Ye, Charles |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08100 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ASTRO: Teaching Language Models to Reason by Reflecting and Backtracking In-Context
by: Kim, Joongwon, et al.
Published: (2025)
by: Kim, Joongwon, et al.
Published: (2025)
Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation
by: Wang, Bing, et al.
Published: (2026)
by: Wang, Bing, et al.
Published: (2026)
Backtracking for Safety
by: Sel, Bilgehan, et al.
Published: (2025)
by: Sel, Bilgehan, et al.
Published: (2025)
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
by: Yang, Xiao-Wen, et al.
Published: (2025)
by: Yang, Xiao-Wen, et al.
Published: (2025)
BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
by: Wu, Qinzhuo, et al.
Published: (2025)
by: Wu, Qinzhuo, et al.
Published: (2025)
Prompt Injection as Role Confusion
by: Ye, Charles, et al.
Published: (2026)
by: Ye, Charles, et al.
Published: (2026)
Correction with Backtracking Reduces Hallucination in Summarization
by: Liu, Zhenzhen, et al.
Published: (2023)
by: Liu, Zhenzhen, et al.
Published: (2023)
Steering When Necessary: Flexible Steering Large Language Models with Backtracking
by: Cheng, Zifeng, et al.
Published: (2025)
by: Cheng, Zifeng, et al.
Published: (2025)
Reinforcement Learning with Backtracking Feedback
by: Sel, Bilgehan, et al.
Published: (2026)
by: Sel, Bilgehan, et al.
Published: (2026)
Backtracking Improves Generation Safety
by: Zhang, Yiming, et al.
Published: (2024)
by: Zhang, Yiming, et al.
Published: (2024)
Layer-Order Inversion: Rethinking Latent Multi-Hop Reasoning in Large Language Models
by: Liu, Xukai, et al.
Published: (2026)
by: Liu, Xukai, et al.
Published: (2026)
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models
by: Yang, Yukang, et al.
Published: (2025)
by: Yang, Yukang, et al.
Published: (2025)
Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization
by: Ye, Wengao, et al.
Published: (2025)
by: Ye, Wengao, et al.
Published: (2025)
OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework
by: Chen, Ben, et al.
Published: (2026)
by: Chen, Ben, et al.
Published: (2026)
Emergent Representations of Program Semantics in Language Models Trained on Programs
by: Jin, Charles, et al.
Published: (2023)
by: Jin, Charles, et al.
Published: (2023)
Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models
by: Chua, James, et al.
Published: (2025)
by: Chua, James, et al.
Published: (2025)
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
by: Wang, Haozhe, et al.
Published: (2025)
by: Wang, Haozhe, et al.
Published: (2025)
Unlocking the Working Memory of Large Language Models for Latent Reasoning
by: Aichberger, Lukas, et al.
Published: (2026)
by: Aichberger, Lukas, et al.
Published: (2026)
Latent Reasoning with Supervised Thinking States
by: Amos, Ido, et al.
Published: (2026)
by: Amos, Ido, et al.
Published: (2026)
Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025)
by: Sun, Guohao, et al.
Published: (2025)
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning
by: Xu, Zifan, et al.
Published: (2023)
by: Xu, Zifan, et al.
Published: (2023)
SeLaR: Selective Latent Reasoning in Large Language Models
by: Fu, Renyu, et al.
Published: (2026)
by: Fu, Renyu, et al.
Published: (2026)
Efficient Post-Training Refinement of Latent Reasoning in Large Language Models
by: Wang, Xinyuan, et al.
Published: (2025)
by: Wang, Xinyuan, et al.
Published: (2025)
Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data
by: Jin, Charles, et al.
Published: (2024)
by: Jin, Charles, et al.
Published: (2024)
Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model
by: Dong, Yihong, et al.
Published: (2025)
by: Dong, Yihong, et al.
Published: (2025)
SEM: Reinforcement Learning for Search-Efficient Large Language Models
by: Sha, Zeyang, et al.
Published: (2025)
by: Sha, Zeyang, et al.
Published: (2025)
Robust Search with Uncertainty-Aware Value Models for Language Model Reasoning
by: Yu, Fei, et al.
Published: (2025)
by: Yu, Fei, et al.
Published: (2025)
Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories
by: Liu, Peiyang, et al.
Published: (2026)
by: Liu, Peiyang, et al.
Published: (2026)
Latent Reasoning via Sentence Embedding Prediction
by: Hwang, Hyeonbin, et al.
Published: (2025)
by: Hwang, Hyeonbin, et al.
Published: (2025)
Learning to Ponder: Adaptive Reasoning in Latent Space
by: He, Yixin, et al.
Published: (2025)
by: He, Yixin, et al.
Published: (2025)
iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning
by: Chen, Sijia, et al.
Published: (2025)
by: Chen, Sijia, et al.
Published: (2025)
BFS-PO: Best-First Search for Large Reasoning Models
by: Parascandolo, Fiorenzo, et al.
Published: (2026)
by: Parascandolo, Fiorenzo, et al.
Published: (2026)
Accelerating Large Language Model Reasoning via Speculative Search
by: Wang, Zhihai, et al.
Published: (2025)
by: Wang, Zhihai, et al.
Published: (2025)
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
by: Shi, Dachuan, et al.
Published: (2025)
by: Shi, Dachuan, et al.
Published: (2025)
Parallel Test-Time Scaling for Latent Reasoning Models
by: You, Runyang, et al.
Published: (2025)
by: You, Runyang, et al.
Published: (2025)
Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models
by: He, Zhenghao, et al.
Published: (2026)
by: He, Zhenghao, et al.
Published: (2026)
LongReasonArena: A Long Reasoning Benchmark for Large Language Models
by: Ding, Jiayu, et al.
Published: (2025)
by: Ding, Jiayu, et al.
Published: (2025)
Lessons from Studying Two-Hop Latent Reasoning
by: Balesni, Mikita, et al.
Published: (2024)
by: Balesni, Mikita, et al.
Published: (2024)
Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision
by: Jang, Chaeyun, et al.
Published: (2025)
by: Jang, Chaeyun, et al.
Published: (2025)
Alignment midtraining for animals
by: Brazilek, Jasmine, et al.
Published: (2026)
by: Brazilek, Jasmine, et al.
Published: (2026)
Similar Items
-
ASTRO: Teaching Language Models to Reason by Reflecting and Backtracking In-Context
by: Kim, Joongwon, et al.
Published: (2025) -
Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation
by: Wang, Bing, et al.
Published: (2026) -
Backtracking for Safety
by: Sel, Bilgehan, et al.
Published: (2025) -
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
by: Yang, Xiao-Wen, et al.
Published: (2025) -
BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
by: Wu, Qinzhuo, et al.
Published: (2025)