Saved in:
| Main Author: | Sapunov, Grigory |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.21999 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Graph Memory Transformer (GMT)
by: Zanarini, Nicola, et al.
Published: (2026)
by: Zanarini, Nicola, et al.
Published: (2026)
Beyond Pass@k: Breadth-Depth Metrics for Reasoning Boundaries
by: Dragoi, Marius, et al.
Published: (2025)
by: Dragoi, Marius, et al.
Published: (2025)
The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems
by: DeVilling, Bentley
Published: (2025)
by: DeVilling, Bentley
Published: (2025)
Forget Attention: Importance-Aware Attention Is All You Need
by: Shin, Soohyeong, et al.
Published: (2026)
by: Shin, Soohyeong, et al.
Published: (2026)
Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
by: Gadzhiev, Artem, et al.
Published: (2026)
by: Gadzhiev, Artem, et al.
Published: (2026)
Contextual Integrity in LLMs via Reasoning and Reinforcement Learning
by: Lan, Guangchen, et al.
Published: (2025)
by: Lan, Guangchen, et al.
Published: (2025)
Counterfactual Likelihood Tests for Indirect Influence in Private Reasoning Channels
by: Lorup, Alexander Boesgaard
Published: (2026)
by: Lorup, Alexander Boesgaard
Published: (2026)
No Free Swap: Protocol-Dependent Layer Redundancy in Transformers
by: Garcia, Gabriel
Published: (2026)
by: Garcia, Gabriel
Published: (2026)
Weakly Supervised Distillation of Hallucination Signals into Transformer Representations
by: Salehmohamed, Shoaib Sadiq, et al.
Published: (2026)
by: Salehmohamed, Shoaib Sadiq, et al.
Published: (2026)
Prototype Transformer: Towards Language Model Architectures Interpretable by Design
by: Yordanov, Yordan, et al.
Published: (2026)
by: Yordanov, Yordan, et al.
Published: (2026)
The Deterministic Horizon: When Extended Reasoning Fails and Tool Delegation Becomes Necessary
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
by: Adapala, Sai Teja Reddy
Published: (2025)
by: Adapala, Sai Teja Reddy
Published: (2025)
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
by: Xu, Shuyao, et al.
Published: (2025)
by: Xu, Shuyao, et al.
Published: (2025)
Generalizing Numerical Reasoning in Table Data through Operation Sketches and Self-Supervised Learning
by: Cho, Hanjun, et al.
Published: (2026)
by: Cho, Hanjun, et al.
Published: (2026)
Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates
by: Kaplanski, Pawel
Published: (2026)
by: Kaplanski, Pawel
Published: (2026)
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers
by: Abramov, Roman, et al.
Published: (2025)
by: Abramov, Roman, et al.
Published: (2025)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models
by: Yocam, Eric, et al.
Published: (2026)
by: Yocam, Eric, et al.
Published: (2026)
Cross-Entropy Is Load-Bearing: A Pre-Registered Scope Test of the K-Way Energy Probe on Bidirectional Predictive Coding
by: Cacioli, Jon-Paul
Published: (2026)
by: Cacioli, Jon-Paul
Published: (2026)
Towards Understanding Sycophancy in Language Models
by: Sharma, Mrinank, et al.
Published: (2023)
by: Sharma, Mrinank, et al.
Published: (2023)
Characterizing Pattern Matching and Its Limits on Compositional Task Structures
by: Chang, Hoyeon, et al.
Published: (2025)
by: Chang, Hoyeon, et al.
Published: (2025)
Dynamic Policy Induction for Adaptive Prompt Optimization: Bridging the Efficiency-Accuracy Gap via Lightweight Reinforcement Learning
by: Xu, Jiexi
Published: (2025)
by: Xu, Jiexi
Published: (2025)
Continuous-Depth Transformers with Learned Control Dynamics
by: Jemley, Peter
Published: (2026)
by: Jemley, Peter
Published: (2026)
Ouroboros: Dynamic Weight Generation for Recursive Transformers via Input-Conditioned LoRA Modulation
by: Jaber, Jaber, et al.
Published: (2026)
by: Jaber, Jaber, et al.
Published: (2026)
Control Reinforcement Learning: Interpretable Token-Level Steering of LLMs via Sparse Autoencoder Features
by: Cho, Seonglae, et al.
Published: (2026)
by: Cho, Seonglae, et al.
Published: (2026)
In-Context Fixation: When Demonstrated Labels Override Semantics in Few-Shot Classification
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
The Last Word Often Wins: A Format Confound in Chain-of-Thought Corruption Studies
by: Garcia, Gabriel
Published: (2026)
by: Garcia, Gabriel
Published: (2026)
TIAR: Trajectory-Informed Advantage Reweighting for LLM Abstention Learning
by: Pan, Muyu, et al.
Published: (2026)
by: Pan, Muyu, et al.
Published: (2026)
Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations
by: Kumar, Sachin
Published: (2026)
by: Kumar, Sachin
Published: (2026)
Structured Prompt Optimization Meets Reinforcement Learning for Global and Local Interpretability over Complex Text
by: Zhou, Tianyang, et al.
Published: (2026)
by: Zhou, Tianyang, et al.
Published: (2026)
AMEL: Accumulated Message Effects on LLM Judgments
by: Temkit, Sid-Ali
Published: (2026)
by: Temkit, Sid-Ali
Published: (2026)
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models
by: Zhang, Gongbo, et al.
Published: (2026)
by: Zhang, Gongbo, et al.
Published: (2026)
Model Collapse as Cultural Evolution
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Alternating Reinforcement Learning with Contextual Rubric Rewards: Beyond the Scalarization Strategy
by: Lan, Guangchen, et al.
Published: (2026)
by: Lan, Guangchen, et al.
Published: (2026)
Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models
by: Liu, Ming
Published: (2026)
by: Liu, Ming
Published: (2026)
Revisiting Intermediate-Layer Matching in Knowledge Distillation: Layer-Selection Strategy Doesn't Matter (Much)
by: Yu, Zony, et al.
Published: (2025)
by: Yu, Zony, et al.
Published: (2025)
Painless Activation Steering: An Automated, Lightweight Approach for Post-Training Large Language Models
by: Cui, Sasha, et al.
Published: (2025)
by: Cui, Sasha, et al.
Published: (2025)
Enhancing Burmese News Classification with Kolmogorov-Arnold Network Head Fine-tuning
by: Aung, Thura, et al.
Published: (2025)
by: Aung, Thura, et al.
Published: (2025)
Similar Items
-
Graph Memory Transformer (GMT)
by: Zanarini, Nicola, et al.
Published: (2026) -
Beyond Pass@k: Breadth-Depth Metrics for Reasoning Boundaries
by: Dragoi, Marius, et al.
Published: (2025) -
The Mirror Loop: Recursive Non-Convergence in Generative Reasoning Systems
by: DeVilling, Bentley
Published: (2025) -
Forget Attention: Importance-Aware Attention Is All You Need
by: Shin, Soohyeong, et al.
Published: (2026) -
Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo
by: Gadzhiev, Artem, et al.
Published: (2026)