:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tang, Mohan, Lu, Sidi
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.17993
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Reasoning LLMs are Wandering Solution Explorers
by: Lu, Jiahao, et al.
Published: (2025)

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
by: Liang, Zhenwen, et al.
Published: (2025)

TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency Models
by: Yoo, Kiwoong, et al.
Published: (2024)

TurboAngle: Near-Lossless KV Cache Compression via Uniform Angle Quantization
by: Patel, Dipkumar
Published: (2026)

Bayesian Deep Learning Via Expectation Maximization and Turbo Deep Approximate Message Passing
by: Xu, Wei, et al.
Published: (2024)

Dynamic Context Adaptation and Information Flow Control in Transformers: Introducing the Evaluator Adjuster Unit and Gated Residual Connections
by: Dhayalkar, Sahil Rajesh
Published: (2024)

TurboAttention: Efficient Attention Approximation For High Throughputs LLMs
by: Kang, Hao, et al.
Published: (2024)

Higher Gauge Flow Models
by: Strunk, Alexander, et al.
Published: (2025)

Layer Specialization Underlying Compositional Reasoning in Transformers
by: Liu, Jing
Published: (2025)

CHAINSFORMER: Numerical Reasoning on Knowledge Graphs from a Chain Perspective
by: Zhao, Ze, et al.
Published: (2025)

Revisiting RaBitQ and TurboQuant: A Symmetric Comparison of Methods, Theory, and Experiments
by: Gao, Jianyang, et al.
Published: (2026)

UnStar: Unlearning with Self-Taught Anti-Sample Reasoning for LLMs
by: Sinha, Yash, et al.
Published: (2024)

On the Limits of Layer Pruning for Generative Reasoning in Large Language Models
by: Shrestha, Safal, et al.
Published: (2026)

Learning Tennis Strategy Through Curriculum-Based Dueling Double Deep Q-Networks
by: Mohan, Vishnu
Published: (2025)

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
by: Liu, Xiaogeng, et al.
Published: (2024)

Unveiling Mode Connectivity in Graph Neural Networks
by: Li, Bingheng, et al.
Published: (2025)

Revealing Combinatorial Reasoning of GNNs via Graph Concept Bottleneck Layer
by: Niu, Yue, et al.
Published: (2026)

Self-Evolving Curriculum for LLM Reasoning
by: Chen, Xiaoyin, et al.
Published: (2025)

Beyond the Lower Bound: Bridging Regret Minimization and Best Arm Identification in Lexicographic Bandits
by: Xue, Bo, et al.
Published: (2025)

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
by: Zhang, Jintao, et al.
Published: (2025)

TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate
by: Zandieh, Amir, et al.
Published: (2025)

Efficient Regression-Based Training of Normalizing Flows for Boltzmann Generators
by: Rehman, Danyal, et al.
Published: (2025)

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?
by: Yin, Yutong, et al.
Published: (2025)

On Variance Reduction in Learning Mean Flows
by: Lu, Juanwu, et al.
Published: (2026)

Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis
by: Fartale, Harshwardhan, et al.
Published: (2025)

Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training
by: Nepal, Aadim, et al.
Published: (2025)

Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores
by: Badash, Zvi N., et al.
Published: (2026)

Efficient Knowledge Tracing Leveraging Higher-Order Information in Integrated Graphs
by: Han, Donghee, et al.
Published: (2025)

Implicit Hypergraph Neural Networks: A Stable Framework for Higher-Order Relational Learning with Provable Guarantees
by: Li, Xiaoyu, et al.
Published: (2025)

CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning
by: Meng, Fanxu, et al.
Published: (2024)

Hypergraph Pattern Machine: Compositional Tokenization for Higher-Order Interactions
by: Zhao, Kyrie, et al.
Published: (2026)

Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model
by: Wu, Jiahao, et al.
Published: (2026)

Higher-order Structure Boosts Link Prediction on Temporal Graphs
by: Liu, Jingzhe, et al.
Published: (2025)

When Fewer Layers Break More Chains: Layer Pruning Harms Test-Time Scaling in LLMs
by: Wang, Keyu, et al.
Published: (2025)

Native Reasoning Models: Training Language Models to Reason on Unverifiable Data
by: Wang, Yuanfu, et al.
Published: (2026)

Implicit Regularization of Gradient Flow on One-Layer Softmax Attention
by: Sheen, Heejune, et al.
Published: (2024)

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)

Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
by: Cho, Minjae, et al.
Published: (2024)

Information-Theoretic Greedy Layer-wise Training for Traffic Sign Recognition
by: Lyu, Shuyan, et al.
Published: (2025)

The Other Side of the Coin: Unveiling the Downsides of Model Aggregation in Federated Learning from a Layer-peeled Perspective
by: Zhu, Guogang, et al.
Published: (2025)