:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Fuxin, Alazali, Amr, Zhong, Yiqiao
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence I.2.6; I.2.7
Online Access:	https://arxiv.org/abs/2602.01017
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic
by: Zhao, Xingyu, et al.
Published: (2026)

The Geometry of Thought: How Scale Restructures Reasoning In Large Language Models
by: Anderson, Samuel Cyrenius
Published: (2026)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)

Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs
by: Eisenstadt, Roy, et al.
Published: (2025)

CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection
by: Panuganti, Rajkiran
Published: (2026)

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning
by: Deshmukh, Pratik, et al.
Published: (2026)

Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features
by: Heyman, Alex, et al.
Published: (2025)

SCULPT: Constraint-Guided Pruned MCTS that Carves Efficient Paths for Mathematical Reasoning
by: Fang, Qitong, et al.
Published: (2026)

Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language Models
by: Steele, Brady, et al.
Published: (2026)

Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)

FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
by: Yuan, Xin, et al.
Published: (2025)

Extreme AutoML: Analysis of Classification, Regression, and NLP Performance
by: Ratner, Edward, et al.
Published: (2024)

PRPO: Aligning Process Reward with Outcome Reward in Policy Optimization
by: Ding, Ruiyi, et al.
Published: (2026)

Quantization Undoes Alignment: Bias Emergence in Compressed LLMs Across Models and Precision Levels
by: Rath, Plawan Kumar, et al.
Published: (2026)

OFMU: Optimization-Driven Framework for Machine Unlearning
by: Asif, Sadia, et al.
Published: (2025)

BLOCK-EM: Preventing Emergent Misalignment via Latent Blocking
by: Ustaomeroglu, Muhammed, et al.
Published: (2026)

Persona Features Control Emergent Misalignment
by: Wang, Miles, et al.
Published: (2025)

Autonomous Deep Agent
by: Yu, Amy, et al.
Published: (2025)

The Anti-Ouroboros Effect: Emergent Resilience in Large Language Models from Recursive Selective Feedback
by: Adapala, Sai Teja Reddy
Published: (2025)

FastGRPO: Accelerating Policy Optimization via Concurrency-aware Speculative Decoding and Online Draft Learning
by: Zhang, Yizhou, et al.
Published: (2025)

Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)

Node-Level Uncertainty Estimation in LLM-Generated SQL
by: Hasson, Hilaf, et al.
Published: (2025)

ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
by: Kadu, Ankush, et al.
Published: (2025)

Benefits and Limitations of Communication in Multi-Agent Reasoning
by: Rizvi-Martel, Michael, et al.
Published: (2025)

Contextual Integrity in LLMs via Reasoning and Reinforcement Learning
by: Lan, Guangchen, et al.
Published: (2025)

Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
by: Abramov, Roman, et al.
Published: (2025)

Beyond Pass@k: Breadth-Depth Metrics for Reasoning Boundaries
by: Dragoi, Marius, et al.
Published: (2025)

Counterfactual Likelihood Tests for Indirect Influence in Private Reasoning Channels
by: Lorup, Alexander Boesgaard
Published: (2026)

Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
by: Adapala, Sai Teja Reddy
Published: (2025)

Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
by: Xu, Shuyao, et al.
Published: (2025)

The Deterministic Horizon: When Extended Reasoning Fails and Tool Delegation Becomes Necessary
by: Guo, Dongxin, et al.
Published: (2026)

DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory
by: Zhou, Wenxuan, et al.
Published: (2025)

The Instability of Safety: How Random Seeds and Temperature Expose Inconsistent LLM Refusal Behavior
by: Larsen, Erik
Published: (2025)

Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning
by: Cho, Hanjun, et al.
Published: (2026)

Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
by: Cui, Sasha, et al.
Published: (2025)

Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies
by: Liu, Ming
Published: (2026)

Alif: Advancing Urdu Large Language Models via Multilingual Synthetic Data Distillation
by: Shafique, Muhammad Ali, et al.
Published: (2025)

On the Limits of Learned Importance Scoring for KV Cache Compression
by: Steele, Brady
Published: (2026)

Domain-Specific Pretraining of Language Models: A Comparative Study in the Medical Field
by: Kerner, Tobias
Published: (2024)

The Last Word Often Wins: A Format Confound in Chain-of-Thought Corruption Studies
by: Garcia, Gabriel
Published: (2026)