Saved in:
| Main Authors: | Pasten, Hector, Urrutia, Felipe, Jimenez, Hector, Calderon, Cristian B., Rojas, Cristóbal, Kozachinskiy, Alexander |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.10606 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Decoupling Positional and Symbolic Attention Behavior in Transformers
by: Urrutia, Felipe, et al.
Published: (2025)
by: Urrutia, Felipe, et al.
Published: (2025)
Strassen Attention, Split VC Dimension and Compositionality in Transformers
by: Kozachinskiy, Alexander, et al.
Published: (2025)
by: Kozachinskiy, Alexander, et al.
Published: (2025)
Positional versus Symbolic Attention Heads: Learning Dynamics, RoPE Geometry, and Length Generalization
by: Urrutia, Felipe, et al.
Published: (2026)
by: Urrutia, Felipe, et al.
Published: (2026)
Lower bounds on transformers with infinite precision
by: Kozachinskiy, Alexander
Published: (2024)
by: Kozachinskiy, Alexander
Published: (2024)
Message Passing on the Edge: Towards Scalable and Expressive GNNs
by: Barceló, Pablo, et al.
Published: (2025)
by: Barceló, Pablo, et al.
Published: (2025)
A completely uniform transformer for parity
by: Kozachinskiy, Alexander, et al.
Published: (2025)
by: Kozachinskiy, Alexander, et al.
Published: (2025)
Simple online learning with consistent oracle
by: Kozachinskiy, Alexander, et al.
Published: (2023)
by: Kozachinskiy, Alexander, et al.
Published: (2023)
Ehrenfeucht-Haussler Rank and Chain of Thought
by: Barceló, Pablo, et al.
Published: (2025)
by: Barceló, Pablo, et al.
Published: (2025)
Parity, Sensitivity, and Transformers
by: Kozachinskiy, Alexander, et al.
Published: (2026)
by: Kozachinskiy, Alexander, et al.
Published: (2026)
On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis
by: Zenil, Hector
Published: (2026)
by: Zenil, Hector
Published: (2026)
On dimensionality of feature vectors in MPNNs
by: Bravo, César, et al.
Published: (2024)
by: Bravo, César, et al.
Published: (2024)
Optimal bounds for dissatisfaction in perpetual voting
by: Kozachinskiy, Alexander, et al.
Published: (2024)
by: Kozachinskiy, Alexander, et al.
Published: (2024)
Language Generation: Complexity Barriers and Implications for Learning
by: Arenas, Marcelo, et al.
Published: (2025)
by: Arenas, Marcelo, et al.
Published: (2025)
Risk-Sensitive RL for Alleviating Exploration Dilemmas in Large Language Models
by: Jiang, Yuhua, et al.
Published: (2025)
by: Jiang, Yuhua, et al.
Published: (2025)
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
by: Zhao, Chongyang, et al.
Published: (2026)
by: Zhao, Chongyang, et al.
Published: (2026)
Explaining k-Nearest Neighbors: Abductive and Counterfactual Explanations
by: Barceló, Pablo, et al.
Published: (2025)
by: Barceló, Pablo, et al.
Published: (2025)
Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models
by: Kyriakou, Athina, et al.
Published: (2026)
by: Kyriakou, Athina, et al.
Published: (2026)
Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery
by: Zhang, Zhipeng, et al.
Published: (2026)
by: Zhang, Zhipeng, et al.
Published: (2026)
The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents
by: Kohaut, Simon, et al.
Published: (2025)
by: Kohaut, Simon, et al.
Published: (2025)
Concisely Explaining the Doubt: Minimum-Size Abductive Explanations for Linear Models with a Reject Option
by: Fernandes, Gleilson Pedro, et al.
Published: (2026)
by: Fernandes, Gleilson Pedro, et al.
Published: (2026)
Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement Learning
by: Stutts, Alex Christopher, et al.
Published: (2024)
by: Stutts, Alex Christopher, et al.
Published: (2024)
Learning Generalized Policies for Fully Observable Non-Deterministic Planning Domains
by: Hofmann, Till, et al.
Published: (2024)
by: Hofmann, Till, et al.
Published: (2024)
Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
by: Borobia, Hector, et al.
Published: (2026)
by: Borobia, Hector, et al.
Published: (2026)
JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models
by: Dragomir, Alexandra, et al.
Published: (2026)
by: Dragomir, Alexandra, et al.
Published: (2026)
The Data Addition Dilemma
by: Shen, Judy Hanwen, et al.
Published: (2024)
by: Shen, Judy Hanwen, et al.
Published: (2024)
STABLE: Gated Continual Learning for Large Language Models
by: Hoy, William, et al.
Published: (2025)
by: Hoy, William, et al.
Published: (2025)
Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems
by: Zhang, Jusheng, et al.
Published: (2026)
by: Zhang, Jusheng, et al.
Published: (2026)
Routing-Based Continual Learning for Multimodal Large Language Models
by: Mohta, Jay, et al.
Published: (2025)
by: Mohta, Jay, et al.
Published: (2025)
Learning General Policies with Policy Gradient Methods
by: Ståhlberg, Simon, et al.
Published: (2025)
by: Ståhlberg, Simon, et al.
Published: (2025)
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs
by: Kohler, Hector, et al.
Published: (2023)
by: Kohler, Hector, et al.
Published: (2023)
Learning More Expressive General Policies for Classical Planning Domains
by: Ståhlberg, Simon, et al.
Published: (2024)
by: Ståhlberg, Simon, et al.
Published: (2024)
The Chicken and Egg Dilemma: Co-optimizing Data and Model Configurations for LLMs
by: Chen, Zhiliang, et al.
Published: (2026)
by: Chen, Zhiliang, et al.
Published: (2026)
DeepRWCap: Neural-Guided Random-Walk Capacitance Solver for IC Design
by: Rodriguez, Hector R., et al.
Published: (2025)
by: Rodriguez, Hector R., et al.
Published: (2025)
Differentiable Learning of Lifted Action Schemas for Classical Planning
by: Reiter, Jonas, et al.
Published: (2026)
by: Reiter, Jonas, et al.
Published: (2026)
CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning
by: Zhao, Yanxiao, et al.
Published: (2025)
by: Zhao, Yanxiao, et al.
Published: (2025)
When Continue Learning Meets Multimodal Large Language Model: A Survey
by: Huo, Yukang, et al.
Published: (2025)
by: Huo, Yukang, et al.
Published: (2025)
Are Large-Language Models Graph Algorithmic Reasoners?
by: Taylor, Alexander K, et al.
Published: (2024)
by: Taylor, Alexander K, et al.
Published: (2024)
Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas: A Survey
by: Deng, Chengyuan, et al.
Published: (2024)
by: Deng, Chengyuan, et al.
Published: (2024)
COPAL: Continual Pruning in Large Language Generative Models
by: Malla, Srikanth, et al.
Published: (2024)
by: Malla, Srikanth, et al.
Published: (2024)
Multi-Agent Systems Powered by Large Language Models: Applications in Swarm Intelligence
by: Jimenez-Romero, Cristian, et al.
Published: (2025)
by: Jimenez-Romero, Cristian, et al.
Published: (2025)
Similar Items
-
Decoupling Positional and Symbolic Attention Behavior in Transformers
by: Urrutia, Felipe, et al.
Published: (2025) -
Strassen Attention, Split VC Dimension and Compositionality in Transformers
by: Kozachinskiy, Alexander, et al.
Published: (2025) -
Positional versus Symbolic Attention Heads: Learning Dynamics, RoPE Geometry, and Length Generalization
by: Urrutia, Felipe, et al.
Published: (2026) -
Lower bounds on transformers with infinite precision
by: Kozachinskiy, Alexander
Published: (2024) -
Message Passing on the Edge: Towards Scalable and Expressive GNNs
by: Barceló, Pablo, et al.
Published: (2025)