Saved in:
| Main Authors: | Traylor, Aaron, Merullo, Jack, Frank, Michael J., Pavlick, Ellie |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.08211 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
by: Hedar, Abdel-Rahman, et al.
Published: (2024)
Multi-Task Reinforcement Learning with Language-Encoded Gated Policy Networks
by: Arora, Rushiv
Published: (2025)
by: Arora, Rushiv
Published: (2025)
When to Forget: A Memory Governance Primitive
by: Simsek, Baris
Published: (2026)
by: Simsek, Baris
Published: (2026)
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)
by: Merullo, Jack, et al.
Published: (2024)
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)
Learning Through Noise: Why Subliminal Learning Works and When It Fails
by: Brockers, Vincent C., et al.
Published: (2026)
by: Brockers, Vincent C., et al.
Published: (2026)
Graph Memory Transformer (GMT)
by: Zanarini, Nicola, et al.
Published: (2026)
by: Zanarini, Nicola, et al.
Published: (2026)
Low-Dimensional Execution Manifolds in Transformer Learning Dynamics: Evidence from Modular Arithmetic Tasks
by: Xu, Yongzhong
Published: (2026)
by: Xu, Yongzhong
Published: (2026)
Learning Alternative Ways of Performing a Task
by: Nieves, David, et al.
Published: (2024)
by: Nieves, David, et al.
Published: (2024)
Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)
by: Fadli, Samih
Published: (2025)
Comprehensive Metapath-based Heterogeneous Graph Transformer for Gene-Disease Association Prediction
by: Cui, Wentao, et al.
Published: (2025)
by: Cui, Wentao, et al.
Published: (2025)
MeMo: Towards Language Models with Associative Memory Mechanisms
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)
A Domain-Independent Agent Architecture for Adaptive Operation in Evolving Open Worlds
by: Mohan, Shiwali, et al.
Published: (2023)
by: Mohan, Shiwali, et al.
Published: (2023)
When Can Human-AI Teams Outperform Individuals? Tight Bounds with Impossibility Guarantees
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents
by: Ye, Ye
Published: (2025)
by: Ye, Ye
Published: (2025)
Residual Reservoir Memory Networks
by: Pinna, Matteo, et al.
Published: (2025)
by: Pinna, Matteo, et al.
Published: (2025)
ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving
by: Kim, Sejin, et al.
Published: (2025)
by: Kim, Sejin, et al.
Published: (2025)
Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
by: Arnesen, Samuel, et al.
Published: (2024)
by: Arnesen, Samuel, et al.
Published: (2024)
Automated CAD Modeling Sequence Generation from Text Descriptions via Transformer-Based Large Language Models
by: Liao, Jianxing, et al.
Published: (2025)
by: Liao, Jianxing, et al.
Published: (2025)
Teacher-Student Guided Inverse Modeling for Steel Final Hardness Estimation
by: Alsheikh, Ahmad, et al.
Published: (2025)
by: Alsheikh, Ahmad, et al.
Published: (2025)
ATANT v1.1: Positioning Continuity Evaluation Against Memory, Long-Context, and Agentic-Memory Benchmarks
by: Tanguturi, Samuel Sameer
Published: (2026)
by: Tanguturi, Samuel Sameer
Published: (2026)
LLM Performance Predictors: Learning When to Escalate in Hybrid Human-AI Moderation Systems
by: Bachar, Or, et al.
Published: (2026)
by: Bachar, Or, et al.
Published: (2026)
When Agents Disagree: The Selection Bottleneck in Multi-Agent LLM Pipelines
by: Maryanskyy, Artem
Published: (2026)
by: Maryanskyy, Artem
Published: (2026)
Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning
by: Sapunov, Grigory
Published: (2026)
by: Sapunov, Grigory
Published: (2026)
ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
by: Kadu, Ankush, et al.
Published: (2025)
by: Kadu, Ankush, et al.
Published: (2025)
Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory
by: Jiang, Rongjie, et al.
Published: (2026)
by: Jiang, Rongjie, et al.
Published: (2026)
When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning
by: Kujur, Arahan
Published: (2026)
by: Kujur, Arahan
Published: (2026)
Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction
by: Kohlberger, Björn Roman
Published: (2026)
by: Kohlberger, Björn Roman
Published: (2026)
Sketch Decompositions for Classical Planning via Deep Reinforcement Learning
by: Aichmüller, Michael, et al.
Published: (2024)
by: Aichmüller, Michael, et al.
Published: (2024)
CRAFT: Clustered Regression for Adaptive Filtering of Training data
by: Panda, Parthasarathi, et al.
Published: (2026)
by: Panda, Parthasarathi, et al.
Published: (2026)
Training Artificial Neural Networks by Coordinate Search Algorithm
by: Rokhsatyazdi, Ehsan, et al.
Published: (2024)
by: Rokhsatyazdi, Ehsan, et al.
Published: (2024)
DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory
by: Zhou, Wenxuan, et al.
Published: (2025)
by: Zhou, Wenxuan, et al.
Published: (2025)
Less is More: Learning Graph Tasks with Just LLMs
by: Shirai, Sola, et al.
Published: (2025)
by: Shirai, Sola, et al.
Published: (2025)
Large Language Models as Attribution Regularizers for Efficient Model Training
by: Vukadin, Davor, et al.
Published: (2025)
by: Vukadin, Davor, et al.
Published: (2025)
Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery
by: Kang, Jiyeon, et al.
Published: (2025)
by: Kang, Jiyeon, et al.
Published: (2025)
Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots
by: Riscos, Pablo de los, et al.
Published: (2024)
by: Riscos, Pablo de los, et al.
Published: (2024)
Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers
by: Yuan, Xinzhe, et al.
Published: (2026)
by: Yuan, Xinzhe, et al.
Published: (2026)
Truth as a Compression Artifact in Language Model Training
by: Krestnikov, Konstantin
Published: (2026)
by: Krestnikov, Konstantin
Published: (2026)
Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
by: Pfau, Jacob, et al.
Published: (2024)
by: Pfau, Jacob, et al.
Published: (2024)
Architectural Proprioception in State Space Models: Thermodynamic Training Induces Anticipatory Halt Detection
by: Noon, Jay
Published: (2026)
by: Noon, Jay
Published: (2026)
Similar Items
-
Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024) -
Multi-Task Reinforcement Learning with Language-Encoded Gated Policy Networks
by: Arora, Rushiv
Published: (2025) -
When to Forget: A Memory Governance Primitive
by: Simsek, Baris
Published: (2026) -
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024) -
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)