Saved in:
| Main Authors: | Tang, Mohan, Lu, Sidi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.17993 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reasoning LLMs are Wandering Solution Explorers
by: Lu, Jiahao, et al.
Published: (2025)
by: Lu, Jiahao, et al.
Published: (2025)
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
by: Liang, Zhenwen, et al.
Published: (2025)
by: Liang, Zhenwen, et al.
Published: (2025)
TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency Models
by: Yoo, Kiwoong, et al.
Published: (2024)
by: Yoo, Kiwoong, et al.
Published: (2024)
TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization
by: Patel, Dipkumar
Published: (2026)
by: Patel, Dipkumar
Published: (2026)
Bayesian Deep Learning Via Expectation Maximization and Turbo Deep Approximate Message Passing
by: Xu, Wei, et al.
Published: (2024)
by: Xu, Wei, et al.
Published: (2024)
Dynamic Context Adaptation and Information Flow Control in Transformers: Introducing the Evaluator Adjuster Unit and Gated Residual Connections
by: Dhayalkar, Sahil Rajesh
Published: (2024)
by: Dhayalkar, Sahil Rajesh
Published: (2024)
TurboAttention: Efficient Attention Approximation For High Throughputs LLMs
by: Kang, Hao, et al.
Published: (2024)
by: Kang, Hao, et al.
Published: (2024)
Higher Gauge Flow Models
by: Strunk, Alexander, et al.
Published: (2025)
by: Strunk, Alexander, et al.
Published: (2025)
Layer Specialization Underlying Compositional Reasoning in Transformers
by: Liu, Jing
Published: (2025)
by: Liu, Jing
Published: (2025)
CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective
by: Zhao, Ze, et al.
Published: (2025)
by: Zhao, Ze, et al.
Published: (2025)
Revisiting RaBitQ and TurboQuant: A Symmetric Comparison of Methods, Theory, and Experiments
by: Gao, Jianyang, et al.
Published: (2026)
by: Gao, Jianyang, et al.
Published: (2026)
UnStar: Unlearning with Self-Taught Anti-Sample Reasoning for LLMs
by: Sinha, Yash, et al.
Published: (2024)
by: Sinha, Yash, et al.
Published: (2024)
On the Limits of Layer Pruning for Generative Reasoning in Large Language Models
by: Shrestha, Safal, et al.
Published: (2026)
by: Shrestha, Safal, et al.
Published: (2026)
Learning Tennis Strategy Through Curriculum-Based Dueling Double Deep Q-Networks
by: Mohan, Vishnu
Published: (2025)
by: Mohan, Vishnu
Published: (2025)
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
by: Liu, Xiaogeng, et al.
Published: (2024)
by: Liu, Xiaogeng, et al.
Published: (2024)
Unveiling Mode Connectivity in Graph Neural Networks
by: Li, Bingheng, et al.
Published: (2025)
by: Li, Bingheng, et al.
Published: (2025)
Revealing Combinatorial Reasoning of GNNs via Graph Concept Bottleneck Layer
by: Niu, Yue, et al.
Published: (2026)
by: Niu, Yue, et al.
Published: (2026)
Self-Evolving Curriculum for LLM Reasoning
by: Chen, Xiaoyin, et al.
Published: (2025)
by: Chen, Xiaoyin, et al.
Published: (2025)
Beyond the Lower Bound: Bridging Regret Minimization and Best Arm Identification in Lexicographic Bandits
by: Xue, Bo, et al.
Published: (2025)
by: Xue, Bo, et al.
Published: (2025)
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
by: Zhang, Jintao, et al.
Published: (2025)
by: Zhang, Jintao, et al.
Published: (2025)
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
by: Zandieh, Amir, et al.
Published: (2025)
by: Zandieh, Amir, et al.
Published: (2025)
Efficient Regression-Based Training of Normalizing Flows for Boltzmann Generators
by: Rehman, Danyal, et al.
Published: (2025)
by: Rehman, Danyal, et al.
Published: (2025)
Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
by: Yin, Yutong, et al.
Published: (2025)
by: Yin, Yutong, et al.
Published: (2025)
On Variance Reduction in Learning Mean Flows
by: Lu, Juanwu, et al.
Published: (2026)
by: Lu, Juanwu, et al.
Published: (2026)
Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis
by: Fartale, Harshwardhan, et al.
Published: (2025)
by: Fartale, Harshwardhan, et al.
Published: (2025)
Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training
by: Nepal, Aadim, et al.
Published: (2025)
by: Nepal, Aadim, et al.
Published: (2025)
Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores
by: Badash, Zvi N., et al.
Published: (2026)
by: Badash, Zvi N., et al.
Published: (2026)
Efficient Knowledge Tracing Leveraging Higher-Order Information in Integrated Graphs
by: Han, Donghee, et al.
Published: (2025)
by: Han, Donghee, et al.
Published: (2025)
Implicit Hypergraph Neural Networks: A Stable Framework for Higher-Order Relational Learning with Provable Guarantees
by: Li, Xiaoyu, et al.
Published: (2025)
by: Li, Xiaoyu, et al.
Published: (2025)
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning
by: Meng, Fanxu, et al.
Published: (2024)
by: Meng, Fanxu, et al.
Published: (2024)
Hypergraph Pattern Machine: Compositional Tokenization for Higher-Order Interactions
by: Zhao, Kyrie, et al.
Published: (2026)
by: Zhao, Kyrie, et al.
Published: (2026)
Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model
by: Wu, Jiahao, et al.
Published: (2026)
by: Wu, Jiahao, et al.
Published: (2026)
Higher-order Structure Boosts Link Prediction on Temporal Graphs
by: Liu, Jingzhe, et al.
Published: (2025)
by: Liu, Jingzhe, et al.
Published: (2025)
When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs
by: Wang, Keyu, et al.
Published: (2025)
by: Wang, Keyu, et al.
Published: (2025)
Native Reasoning Models: Training Language Models to Reason on Unverifiable Data
by: Wang, Yuanfu, et al.
Published: (2026)
by: Wang, Yuanfu, et al.
Published: (2026)
Implicit Regularization of Gradient Flow on One-Layer Softmax Attention
by: Sheen, Heejune, et al.
Published: (2024)
by: Sheen, Heejune, et al.
Published: (2024)
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)
by: Zhang, Mohan, et al.
Published: (2025)
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
by: Cho, Minjae, et al.
Published: (2024)
by: Cho, Minjae, et al.
Published: (2024)
Information-Theoretic Greedy Layer-wise Training for Traffic Sign Recognition
by: Lyu, Shuyan, et al.
Published: (2025)
by: Lyu, Shuyan, et al.
Published: (2025)
The Other Side of the Coin: Unveiling the Downsides of Model Aggregation in Federated Learning from a Layer-peeled Perspective
by: Zhu, Guogang, et al.
Published: (2025)
by: Zhu, Guogang, et al.
Published: (2025)
Similar Items
-
Reasoning LLMs are Wandering Solution Explorers
by: Lu, Jiahao, et al.
Published: (2025) -
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
by: Liang, Zhenwen, et al.
Published: (2025) -
TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency Models
by: Yoo, Kiwoong, et al.
Published: (2024) -
TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization
by: Patel, Dipkumar
Published: (2026) -
Bayesian Deep Learning Via Expectation Maximization and Turbo Deep Approximate Message Passing
by: Xu, Wei, et al.
Published: (2024)