Saved in:
| Main Authors: | Abel, David, Barreto, André, Bowling, Michael, Dabney, Will, Dong, Shi, Hansen, Steven, Harutyunyan, Anna, Khetarpal, Khimya, Lyle, Clare, Pascanu, Razvan, Piliouras, Georgios, Precup, Doina, Richens, Jonathan, Rowland, Mark, Schaul, Tom, Singh, Satinder |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.04403 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Plasticity as the Mirror of Empowerment
by: Abel, David, et al.
Published: (2025)
by: Abel, David, et al.
Published: (2025)
Disentangling the Causes of Plasticity Loss in Neural Networks
by: Lyle, Clare, et al.
Published: (2024)
by: Lyle, Clare, et al.
Published: (2024)
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)
by: Lyle, Clare, et al.
Published: (2024)
Affordances Enable Partial World Modeling with LLMs
by: Khetarpal, Khimya, et al.
Published: (2026)
by: Khetarpal, Khimya, et al.
Published: (2026)
Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments
by: Beukman, Michael, et al.
Published: (2026)
by: Beukman, Michael, et al.
Published: (2026)
Optimizing Return Distributions with Distributional Dynamic Programming
by: Pires, Bernardo Ávila, et al.
Published: (2025)
by: Pires, Bernardo Ávila, et al.
Published: (2025)
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
by: Cherif, Lynn, et al.
Published: (2025)
by: Cherif, Lynn, et al.
Published: (2025)
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning
by: Khetarpal, Khimya, et al.
Published: (2024)
by: Khetarpal, Khimya, et al.
Published: (2024)
Capacity-Constrained Continual Learning
by: Wen, Zheng, et al.
Published: (2025)
by: Wen, Zheng, et al.
Published: (2025)
What Can Grokking Teach Us About Learning Under Nonstationarity?
by: Lyle, Clare, et al.
Published: (2025)
by: Lyle, Clare, et al.
Published: (2025)
Robust Intervention Learning from Emergency Stop Interventions
by: Pronovost, Ethan, et al.
Published: (2026)
by: Pronovost, Ethan, et al.
Published: (2026)
Boundless Socratic Learning with Language Games
by: Schaul, Tom
Published: (2024)
by: Schaul, Tom
Published: (2024)
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
by: Rowland, Mark, et al.
Published: (2024)
by: Rowland, Mark, et al.
Published: (2024)
Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)
by: Kamat, Anand, et al.
Published: (2020)
Functional Acceleration for Policy Mirror Descent
by: Chelu, Veronica, et al.
Published: (2024)
by: Chelu, Veronica, et al.
Published: (2024)
A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings
by: Alver, Safa, et al.
Published: (2022)
by: Alver, Safa, et al.
Published: (2022)
Self-Predictive Representations for Combinatorial Generalization in Behavioral Cloning
by: Lawson, Daniel, et al.
Published: (2025)
by: Lawson, Daniel, et al.
Published: (2025)
Capturing Individual Human Preferences with Reward Features
by: Barreto, André, et al.
Published: (2025)
by: Barreto, André, et al.
Published: (2025)
General agents contain world models
by: Richens, Jonathan, et al.
Published: (2025)
by: Richens, Jonathan, et al.
Published: (2025)
Fine-Tuned In-Context Learners for Efficient Adaptation
by: Bornschein, Jorg, et al.
Published: (2025)
by: Bornschein, Jorg, et al.
Published: (2025)
Robust agents learn causal world models
by: Richens, Jonathan, et al.
Published: (2024)
by: Richens, Jonathan, et al.
Published: (2024)
Representation Learning via Non-Contrastive Mutual Information
by: Guo, Zhaohan Daniel, et al.
Published: (2025)
by: Guo, Zhaohan Daniel, et al.
Published: (2025)
Balancing Plasticity and Stability with Fast and Slow Successor Features
by: Chua, Raymond, et al.
Published: (2026)
by: Chua, Raymond, et al.
Published: (2026)
On the Privacy of Selection Mechanisms with Gaussian Noise
by: Lebensold, Jonathan, et al.
Published: (2024)
by: Lebensold, Jonathan, et al.
Published: (2024)
Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset
by: Galashov, Alexandre, et al.
Published: (2024)
by: Galashov, Alexandre, et al.
Published: (2024)
A Distributional Analogue to the Successor Representation
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023)
by: Carr, Jonathan Colaço, et al.
Published: (2023)
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)
by: Arnob, Samin Yeasar, et al.
Published: (2025)
Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)
by: Jain, Arushi, et al.
Published: (2024)
Fluid-Agent Reinforcement Learning
by: Sharma, Shishir, et al.
Published: (2026)
by: Sharma, Shishir, et al.
Published: (2026)
Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)
by: Alver, Safa, et al.
Published: (2024)
Lattice: Learning to Efficiently Compress the Memory
by: Karami, Mahdi, et al.
Published: (2025)
by: Karami, Mahdi, et al.
Published: (2025)
Deep Grokking: Would Deep Neural Networks Generalize Better?
by: Fan, Simin, et al.
Published: (2024)
by: Fan, Simin, et al.
Published: (2024)
Toward Human-AI Alignment in Large-Scale Multi-Player Games
by: Sharma, Sugandha, et al.
Published: (2024)
by: Sharma, Sugandha, et al.
Published: (2024)
The Limits of Predicting Agents from Behaviour
by: Bellot, Alexis, et al.
Published: (2025)
by: Bellot, Alexis, et al.
Published: (2025)
An Analysis of Quantile Temporal-Difference Learning
by: Rowland, Mark, et al.
Published: (2023)
by: Rowland, Mark, et al.
Published: (2023)
Parseval Regularization for Continual Reinforcement Learning
by: Chung, Wesley, et al.
Published: (2024)
by: Chung, Wesley, et al.
Published: (2024)
Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)
by: Deleu, Tristan, et al.
Published: (2025)
NoProp: Training Neural Networks without Full Back-propagation or Full Forward-propagation
by: Li, Qinyu, et al.
Published: (2025)
by: Li, Qinyu, et al.
Published: (2025)
Meta-learning how to Share Credit among Macro-Actions
by: Hosu, Ionel-Alexandru, et al.
Published: (2025)
by: Hosu, Ionel-Alexandru, et al.
Published: (2025)
Similar Items
-
Plasticity as the Mirror of Empowerment
by: Abel, David, et al.
Published: (2025) -
Disentangling the Causes of Plasticity Loss in Neural Networks
by: Lyle, Clare, et al.
Published: (2024) -
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024) -
Affordances Enable Partial World Modeling with LLMs
by: Khetarpal, Khimya, et al.
Published: (2026) -
Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments
by: Beukman, Michael, et al.
Published: (2026)