:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Traylor, Aaron, Merullo, Jack, Frank, Michael J., Pavlick, Ellie
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence I.2.6
Online Access:	https://arxiv.org/abs/2402.08211
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Deep Memory Search: A Metaheuristic Approach for Optimizing Heuristic Search
by: Hedar, Abdel-Rahman, et al.
Published: (2024)

Multi-Task Reinforcement Learning with Language-Encoded Gated Policy Networks
by: Arora, Rushiv
Published: (2025)

When to Forget: A Memory Governance Primitive
by: Simsek, Baris
Published: (2026)

Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)

Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers
by: Avinash, Mynampati Sri Ranganadha
Published: (2026)

Learning Through Noise: Why Subliminal Learning Works and When It Fails
by: Brockers, Vincent C., et al.
Published: (2026)

Graph Memory Transformer (GMT)
by: Zanarini, Nicola, et al.
Published: (2026)

Low-Dimensional Execution Manifolds in Transformer Learning Dynamics: Evidence from Modular Arithmetic Tasks
by: Xu, Yongzhong
Published: (2026)

Learning Alternative Ways of Performing a Task
by: Nieves, David, et al.
Published: (2024)

Entropy-Based Measurement of Value Drift and Alignment Work in Large Language Models
by: Fadli, Samih
Published: (2025)

Comprehensive Metapath-based Heterogeneous Graph Transformer for Gene-Disease Association Prediction
by: Cui, Wentao, et al.
Published: (2025)

MeMo: Towards Language Models with Associative Memory Mechanisms
by: Zanzotto, Fabio Massimo, et al.
Published: (2025)

A Domain-Independent Agent Architecture for Adaptive Operation in Evolving Open Worlds
by: Mohan, Shiwali, et al.
Published: (2023)

When Can Human-AI Teams Outperform Individuals? Tight Bounds with Impossibility Guarantees
by: Guo, Dongxin, et al.
Published: (2026)

Task Memory Engine: Spatial Memory for Robust Multi-Step LLM Agents
by: Ye, Ye
Published: (2025)

Residual Reservoir Memory Networks
by: Pinna, Matteo, et al.
Published: (2025)

ARCTraj: A Dataset and Benchmark of Human Reasoning Trajectories for Abstract Problem Solving
by: Kim, Sejin, et al.
Published: (2025)

Training Language Models to Win Debates with Self-Play Improves Judge Accuracy
by: Arnesen, Samuel, et al.
Published: (2024)

Automated CAD Modeling Sequence Generation from Text Descriptions via Transformer-Based Large Language Models
by: Liao, Jianxing, et al.
Published: (2025)

Teacher-Student Guided Inverse Modeling for Steel Final Hardness Estimation
by: Alsheikh, Ahmad, et al.
Published: (2025)

ATANT v1.1: Positioning Continuity Evaluation Against Memory, Long-Context, and Agentic-Memory Benchmarks
by: Tanguturi, Samuel Sameer
Published: (2026)

LLM Performance Predictors: Learning When to Escalate in Hybrid Human-AI Moderation Systems
by: Bachar, Or, et al.
Published: (2026)

When Agents Disagree: The Selection Bottleneck in Multi-Agent LLM Pipelines
by: Maryanskyy, Artem
Published: (2026)

Universal Transformers Need Memory: Depth-State Trade-offs in Adaptive Recursive Reasoning
by: Sapunov, Grigory
Published: (2026)

ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing
by: Kadu, Ankush, et al.
Published: (2025)

Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory
by: Jiang, Rongjie, et al.
Published: (2026)

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning
by: Kujur, Arahan
Published: (2026)

Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction
by: Kohlberger, Björn Roman
Published: (2026)

Sketch Decompositions for Classical Planning via Deep Reinforcement Learning
by: Aichmüller, Michael, et al.
Published: (2024)

CRAFT: Clustered Regression for Adaptive Filtering of Training data
by: Panda, Parthasarathi, et al.
Published: (2026)

Training Artificial Neural Networks by Coordinate Search Algorithm
by: Rokhsatyazdi, Ehsan, et al.
Published: (2024)

DPO Unchained: Your Training Algorithm is Secretly Disentangled in Human Choice Theory
by: Zhou, Wenxuan, et al.
Published: (2025)

Less is More: Learning Graph Tasks with Just LLMs
by: Shirai, Sola, et al.
Published: (2025)

Large Language Models as Attribution Regularizers for Efficient Model Training
by: Vukadin, Davor, et al.
Published: (2025)

Score-informed Neural Operator for Enhancing Ordering-based Causal Discovery
by: Kang, Jiyeon, et al.
Published: (2025)

Working Paper: Active Causal Structure Learning with Latent Variables: Towards Learning to Detour in Autonomous Robots
by: Riscos, Pablo de los, et al.
Published: (2024)

Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers
by: Yuan, Xinzhe, et al.
Published: (2026)

Truth as a Compression Artifact in Language Model Training
by: Krestnikov, Konstantin
Published: (2026)

Let's Think Dot by Dot: Hidden Computation in Transformer Language Models
by: Pfau, Jacob, et al.
Published: (2024)

Architectural Proprioception in State Space Models: Thermodynamic Training Induces Anticipatory Halt Detection
by: Noon, Jay
Published: (2026)