Saved in:
| Main Authors: | Wang, Fuxin, Alazali, Amr, Zhong, Yiqiao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01017 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic
by: Zhao, Xingyu, et al.
Published: (2026)
by: Zhao, Xingyu, et al.
Published: (2026)
The Geometry of Thought: How Scale Restructures Reasoning In Large Language Models
by: Anderson, Samuel Cyrenius
Published: (2026)
by: Anderson, Samuel Cyrenius
Published: (2026)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs
by: Eisenstadt, Roy, et al.
Published: (2025)
by: Eisenstadt, Roy, et al.
Published: (2025)
CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection
by: Panuganti, Rajkiran
Published: (2026)
by: Panuganti, Rajkiran
Published: (2026)
On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning
by: Deshmukh, Pratik, et al.
Published: (2026)
by: Deshmukh, Pratik, et al.
Published: (2026)
Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features
by: Heyman, Alex, et al.
Published: (2025)
by: Heyman, Alex, et al.
Published: (2025)
SCULPT: Constraint-Guided Pruned MCTS that Carves Efficient Paths for Mathematical Reasoning
by: Fang, Qitong, et al.
Published: (2026)
by: Fang, Qitong, et al.
Published: (2026)
Scaling Trends for Multi-Hop Contextual Reasoning in Mid-Scale Language Models
by: Steele, Brady, et al.
Published: (2026)
by: Steele, Brady, et al.
Published: (2026)
Social Cooperation in Conversational AI Agents
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
by: Çelikok, Mustafa Mert, et al.
Published: (2025)
FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning
by: Yuan, Xin, et al.
Published: (2025)
by: Yuan, Xin, et al.
Published: (2025)
Extreme AutoML: Analysis of Classification, Regression, and NLP Performance
by: Ratner, Edward, et al.
Published: (2024)
by: Ratner, Edward, et al.
Published: (2024)
PRPO: Aligning Process Reward with Outcome Reward in Policy Optimization
by: Ding, Ruiyi, et al.
Published: (2026)
by: Ding, Ruiyi, et al.
Published: (2026)
Quantization Undoes Alignment: Bias Emergence in Compressed LLMs Across Models and Precision Levels
by: Rath, Plawan Kumar, et al.
Published: (2026)
by: Rath, Plawan Kumar, et al.
Published: (2026)
OFMU: Optimization-Driven Framework for Machine Unlearning
by: Asif, Sadia, et al.
Published: (2025)
by: Asif, Sadia, et al.
Published: (2025)
BLOCK-EM: Preventing Emergent Misalignment via Latent Blocking
by: Ustaomeroglu, Muhammed, et al.
Published: (2026)
by: Ustaomeroglu, Muhammed, et al.
Published: (2026)
Persona Features Control Emergent Misalignment
by: Wang, Miles, et al.
Published: (2025)
by: Wang, Miles, et al.
Published: (2025)
Autonomous Deep Agent
by: Yu, Amy, et al.
Published: (2025)
by: Yu, Amy, et al.
Published: (2025)
The Anti-Ouroboros Effect: Emergent Resilience in Large Language Models from Recursive Selective Feedback
by: Adapala, Sai Teja Reddy
Published: (2025)
by: Adapala, Sai Teja Reddy
Published: (2025)
FastGRPO: Accelerating Policy Optimization via Concurrency-aware Speculative Decoding and Online Draft Learning
by: Zhang, Yizhou, et al.
Published: (2025)
by: Zhang, Yizhou, et al.
Published: (2025)
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)
Node-Level Uncertainty Estimation in LLM-Generated SQL
by: Hasson, Hilaf, et al.
Published: (2025)
by: Hasson, Hilaf, et al.
Published: (2025)
ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
by: Kadu, Ankush, et al.
Published: (2025)
by: Kadu, Ankush, et al.
Published: (2025)
Benefits and Limitations of Communication in Multi-Agent Reasoning
by: Rizvi-Martel, Michael, et al.
Published: (2025)
by: Rizvi-Martel, Michael, et al.
Published: (2025)
Contextual Integrity in LLMs via Reasoning and Reinforcement Learning
by: Lan, Guangchen, et al.
Published: (2025)
by: Lan, Guangchen, et al.
Published: (2025)
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
by: Abramov, Roman, et al.
Published: (2025)
by: Abramov, Roman, et al.
Published: (2025)
Beyond Pass@k: Breadth-Depth Metrics for Reasoning Boundaries
by: Dragoi, Marius, et al.
Published: (2025)
by: Dragoi, Marius, et al.
Published: (2025)
Counterfactual Likelihood Tests for Indirect Influence in Private Reasoning Channels
by: Lorup, Alexander Boesgaard
Published: (2026)
by: Lorup, Alexander Boesgaard
Published: (2026)
Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
by: Adapala, Sai Teja Reddy
Published: (2025)
by: Adapala, Sai Teja Reddy
Published: (2025)
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
by: Xu, Shuyao, et al.
Published: (2025)
by: Xu, Shuyao, et al.
Published: (2025)
The Deterministic Horizon: When Extended Reasoning Fails and Tool Delegation Becomes Necessary
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory
by: Zhou, Wenxuan, et al.
Published: (2025)
by: Zhou, Wenxuan, et al.
Published: (2025)
The Instability of Safety: How Random Seeds and Temperature Expose Inconsistent LLM Refusal Behavior
by: Larsen, Erik
Published: (2025)
by: Larsen, Erik
Published: (2025)
Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning
by: Cho, Hanjun, et al.
Published: (2026)
by: Cho, Hanjun, et al.
Published: (2026)
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
by: Cui, Sasha, et al.
Published: (2025)
by: Cui, Sasha, et al.
Published: (2025)
Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
Alif: Advancing Urdu Large Language Models via Multilingual Synthetic Data Distillation
by: Shafique, Muhammad Ali, et al.
Published: (2025)
by: Shafique, Muhammad Ali, et al.
Published: (2025)
On the Limits of Learned Importance Scoring for KV Cache Compression
by: Steele, Brady
Published: (2026)
by: Steele, Brady
Published: (2026)
Domain-Specific Pretraining of Language Models: A Comparative Study in the Medical Field
by: Kerner, Tobias
Published: (2024)
by: Kerner, Tobias
Published: (2024)
The Last Word Often Wins: A Format Confound in Chain-of-Thought Corruption Studies
by: Garcia, Gabriel
Published: (2026)
by: Garcia, Gabriel
Published: (2026)
Similar Items
-
Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic
by: Zhao, Xingyu, et al.
Published: (2026) -
The Geometry of Thought: How Scale Restructures Reasoning In Large Language Models
by: Anderson, Samuel Cyrenius
Published: (2026) -
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025) -
Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs
by: Eisenstadt, Roy, et al.
Published: (2025) -
CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection
by: Panuganti, Rajkiran
Published: (2026)