:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pasten, Hector, Urrutia, Felipe, Jimenez, Hector, Calderon, Cristian B., Rojas, Cristóbal, Kozachinskiy, Alexander
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.10606
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Decoupling Positional and Symbolic Attention Behavior in Transformers
by: Urrutia, Felipe, et al.
Published: (2025)

Strassen Attention, Split VC Dimension and Compositionality in Transformers
by: Kozachinskiy, Alexander, et al.
Published: (2025)

Positional versus Symbolic Attention Heads: Learning Dynamics, RoPE Geometry, and Length Generalization
by: Urrutia, Felipe, et al.
Published: (2026)

Lower bounds on transformers with infinite precision
by: Kozachinskiy, Alexander
Published: (2024)

Message Passing on the Edge: Towards Scalable and Expressive GNNs
by: Barceló, Pablo, et al.
Published: (2025)

A completely uniform transformer for parity
by: Kozachinskiy, Alexander, et al.
Published: (2025)

Simple online learning with consistent oracle
by: Kozachinskiy, Alexander, et al.
Published: (2023)

Ehrenfeucht-Haussler Rank and Chain of Thought
by: Barceló, Pablo, et al.
Published: (2025)

Parity, Sensitivity, and Transformers
by: Kozachinskiy, Alexander, et al.
Published: (2026)

On the Limits of Self-Improving in Large Language Models: The Singularity Is Not Near Without Symbolic Model Synthesis
by: Zenil, Hector
Published: (2026)

On dimensionality of feature vectors in MPNNs
by: Bravo, César, et al.
Published: (2024)

Optimal bounds for dissatisfaction in perpetual voting
by: Kozachinskiy, Alexander, et al.
Published: (2024)

Language Generation: Complexity Barriers and Implications for Learning
by: Arenas, Marcelo, et al.
Published: (2025)

Risk-Sensitive RL for Alleviating Exploration Dilemmas in Large Language Models
by: Jiang, Yuhua, et al.
Published: (2025)

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models
by: Zhao, Chongyang, et al.
Published: (2026)

Explaining k-Nearest Neighbors: Abductive and Counterfactual Explanations
by: Barceló, Pablo, et al.
Published: (2025)

Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models
by: Kyriakou, Athina, et al.
Published: (2026)

Meta-Cognitive Reinforcement Learning with Self-Doubt and Recovery
by: Zhang, Zhipeng, et al.
Published: (2026)

The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents
by: Kohaut, Simon, et al.
Published: (2025)

Concisely Explaining the Doubt: Minimum-Size Abductive Explanations for Linear Models with a Reject Option
by: Fernandes, Gleilson Pedro, et al.
Published: (2026)

Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement Learning
by: Stutts, Alex Christopher, et al.
Published: (2024)

Learning Generalized Policies for Fully Observable Non-Deterministic Planning Domains
by: Hofmann, Till, et al.
Published: (2024)

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures
by: Borobia, Hector, et al.
Published: (2026)

JumpLoRA: Sparse Adapters for Continual Learning in Large Language Models
by: Dragomir, Alexandra, et al.
Published: (2026)

The Data Addition Dilemma
by: Shen, Judy Hanwen, et al.
Published: (2024)

STABLE: Gated Continual Learning for Large Language Models
by: Hoy, William, et al.
Published: (2025)

Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems
by: Zhang, Jusheng, et al.
Published: (2026)

Routing-Based Continual Learning for Multimodal Large Language Models
by: Mohta, Jay, et al.
Published: (2025)

Learning General Policies with Policy Gradient Methods
by: Ståhlberg, Simon, et al.
Published: (2025)

Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs
by: Kohler, Hector, et al.
Published: (2023)

Learning More Expressive General Policies for Classical Planning Domains
by: Ståhlberg, Simon, et al.
Published: (2024)

The Chicken and Egg Dilemma: Co-optimizing Data and Model Configurations for LLMs
by: Chen, Zhiliang, et al.
Published: (2026)

DeepRWCap: Neural-Guided Random-Walk Capacitance Solver for IC Design
by: Rodriguez, Hector R., et al.
Published: (2025)

Differentiable Learning of Lifted Action Schemas for Classical Planning
by: Reiter, Jonas, et al.
Published: (2026)

CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning
by: Zhao, Yanxiao, et al.
Published: (2025)

When Continue Learning Meets Multimodal Large Language Model: A Survey
by: Huo, Yukang, et al.
Published: (2025)

Are Large-Language Models Graph Algorithmic Reasoners?
by: Taylor, Alexander K, et al.
Published: (2024)

Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas: A Survey
by: Deng, Chengyuan, et al.
Published: (2024)

COPAL: Continual Pruning in Large Language Generative Models
by: Malla, Srikanth, et al.
Published: (2024)

Multi-Agent Systems Powered by Large Language Models: Applications in Swarm Intelligence
by: Jimenez-Romero, Cristian, et al.
Published: (2025)