:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Abel, David, Barreto, André, Bowling, Michael, Dabney, Will, Dong, Shi, Hansen, Steven, Harutyunyan, Anna, Khetarpal, Khimya, Lyle, Clare, Pascanu, Razvan, Piliouras, Georgios, Precup, Doina, Richens, Jonathan, Rowland, Mark, Schaul, Tom, Singh, Satinder
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.04403
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Plasticity as the Mirror of Empowerment
by: Abel, David, et al.
Published: (2025)

Disentangling the Causes of Plasticity Loss in Neural Networks
by: Lyle, Clare, et al.
Published: (2024)

Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)

Affordances Enable Partial World Modeling with LLMs
by: Khetarpal, Khimya, et al.
Published: (2026)

Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments
by: Beukman, Michael, et al.
Published: (2026)

Optimizing Return Distributions with Distributional Dynamic Programming
by: Pires, Bernardo Ávila, et al.
Published: (2025)

Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
by: Cherif, Lynn, et al.
Published: (2025)

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning
by: Khetarpal, Khimya, et al.
Published: (2024)

Capacity-Constrained Continual Learning
by: Wen, Zheng, et al.
Published: (2025)

What Can Grokking Teach Us About Learning Under Nonstationarity?
by: Lyle, Clare, et al.
Published: (2025)

Robust Intervention Learning from Emergency Stop Interventions
by: Pronovost, Ethan, et al.
Published: (2026)

Boundless Socratic Learning with Language Games
by: Schaul, Tom
Published: (2024)

Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
by: Rowland, Mark, et al.
Published: (2024)

Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)

Functional Acceleration for Policy Mirror Descent
by: Chelu, Veronica, et al.
Published: (2024)

A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings
by: Alver, Safa, et al.
Published: (2022)

Self-Predictive Representations for Combinatorial Generalization in Behavioral Cloning
by: Lawson, Daniel, et al.
Published: (2025)

Capturing Individual Human Preferences with Reward Features
by: Barreto, André, et al.
Published: (2025)

General agents contain world models
by: Richens, Jonathan, et al.
Published: (2025)

Fine-Tuned In-Context Learners for Efficient Adaptation
by: Bornschein, Jorg, et al.
Published: (2025)

Robust agents learn causal world models
by: Richens, Jonathan, et al.
Published: (2024)

Representation Learning via Non-Contrastive Mutual Information
by: Guo, Zhaohan Daniel, et al.
Published: (2025)

Balancing Plasticity and Stability with Fast and Slow Successor Features
by: Chua, Raymond, et al.
Published: (2026)

On the Privacy of Selection Mechanisms with Gaussian Noise
by: Lebensold, Jonathan, et al.
Published: (2024)

Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
by: Galashov, Alexandre, et al.
Published: (2024)

A Distributional Analogue to the Successor Representation
by: Wiltzer, Harley, et al.
Published: (2024)

Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023)

Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)

Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)

Fluid-Agent Reinforcement Learning
by: Sharma, Shishir, et al.
Published: (2026)

Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)

Lattice: Learning to Efficiently Compress the Memory
by: Karami, Mahdi, et al.
Published: (2025)

Deep Grokking: Would Deep Neural Networks Generalize Better?
by: Fan, Simin, et al.
Published: (2024)

Toward Human-AI Alignment in Large-Scale Multi-Player Games
by: Sharma, Sugandha, et al.
Published: (2024)

The Limits of Predicting Agents from Behaviour
by: Bellot, Alexis, et al.
Published: (2025)

An Analysis of Quantile Temporal-Difference Learning
by: Rowland, Mark, et al.
Published: (2023)

Parseval Regularization for Continual Reinforcement Learning
by: Chung, Wesley, et al.
Published: (2024)

Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)

NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
by: Li, Qinyu, et al.
Published: (2025)

Meta-learning how to Share Credit among Macro-Actions
by: Hosu, Ionel-Alexandru, et al.
Published: (2025)