Saved in:
| Main Authors: | Farebrother, Jesse, Castro, Pablo Samuel |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.23810 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mixtures of Experts Unlock Parameter Scaling for Deep RL
by: Obando-Ceron, Johan, et al.
Published: (2024)
by: Obando-Ceron, Johan, et al.
Published: (2024)
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
by: Jain, Arnav Kumar, et al.
Published: (2024)
by: Jain, Arnav Kumar, et al.
Published: (2024)
Temporal Difference Flows
by: Farebrother, Jesse, et al.
Published: (2025)
by: Farebrother, Jesse, et al.
Published: (2025)
Compositional Planning with Jumpy World Models
by: Farebrother, Jesse, et al.
Published: (2026)
by: Farebrother, Jesse, et al.
Published: (2026)
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
by: Farebrother, Jesse, et al.
Published: (2024)
by: Farebrother, Jesse, et al.
Published: (2024)
The Formalism-Implementation Gap in Reinforcement Learning Research
by: Castro, Pablo Samuel
Published: (2025)
by: Castro, Pablo Samuel
Published: (2025)
Investigating Memory in Model-Free RL with POPGym Arcade
by: Wang, Zekang, et al.
Published: (2025)
by: Wang, Zekang, et al.
Published: (2025)
A Distributional Analogue to the Successor Representation
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
A Survey of State Representation Learning for Deep Reinforcement Learning
by: Echchahed, Ayoub, et al.
Published: (2025)
by: Echchahed, Ayoub, et al.
Published: (2025)
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
by: Tang, Hongyao, et al.
Published: (2025)
by: Tang, Hongyao, et al.
Published: (2025)
Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
by: Sokar, Ghada, et al.
Published: (2025)
by: Sokar, Ghada, et al.
Published: (2025)
NAVIX: Scaling MiniGrid Environments with JAX
by: Pignatelli, Eduardo, et al.
Published: (2024)
by: Pignatelli, Eduardo, et al.
Published: (2024)
The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
by: Mayor, Walter, et al.
Published: (2025)
by: Mayor, Walter, et al.
Published: (2025)
In value-based deep reinforcement learning, a pruned network is a good network
by: Obando-Ceron, Johan, et al.
Published: (2024)
by: Obando-Ceron, Johan, et al.
Published: (2024)
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
by: Castanyer, Roger Creus, et al.
Published: (2025)
by: Castanyer, Roger Creus, et al.
Published: (2025)
Continual Learning in Vision-Language Models via Aligned Model Merging
by: Sokar, Ghada, et al.
Published: (2025)
by: Sokar, Ghada, et al.
Published: (2025)
Stable Deep Reinforcement Learning via Isotropic Gaussian Representations
by: Pasand, Ali Saheb, et al.
Published: (2026)
by: Pasand, Ali Saheb, et al.
Published: (2026)
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
by: Garcin, Samuel, et al.
Published: (2025)
by: Garcin, Samuel, et al.
Published: (2025)
CLeAN: Continual Learning Adaptive Normalization in Dynamic Environments
by: Marasco, Isabella, et al.
Published: (2026)
by: Marasco, Isabella, et al.
Published: (2026)
Multi-Task Reinforcement Learning Enables Parameter Scaling
by: McLean, Reginald, et al.
Published: (2025)
by: McLean, Reginald, et al.
Published: (2025)
HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning
by: Delfosse, Quentin, et al.
Published: (2024)
by: Delfosse, Quentin, et al.
Published: (2024)
Continual Learning for Adaptable Car-Following in Dynamic Traffic Environments
by: Chen, Xianda, et al.
Published: (2024)
by: Chen, Xianda, et al.
Published: (2024)
Natural Mitigation of Catastrophic Interference: Continual Learning in Power-Law Learning Environments
by: Gandhi, Atith, et al.
Published: (2024)
by: Gandhi, Atith, et al.
Published: (2024)
Explorative Imitation Learning: A Path Signature Approach for Continuous Environments
by: Gavenski, Nathan, et al.
Published: (2024)
by: Gavenski, Nathan, et al.
Published: (2024)
On Sequential Bayesian Inference for Continual Learning
by: Kessler, Samuel, et al.
Published: (2023)
by: Kessler, Samuel, et al.
Published: (2023)
Continual Reinforcement Learning via Autoencoder-Driven Task and New Environment Recognition
by: Erden, Zeki Doruk, et al.
Published: (2025)
by: Erden, Zeki Doruk, et al.
Published: (2025)
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
by: Obando-Ceron, Johan, et al.
Published: (2024)
by: Obando-Ceron, Johan, et al.
Published: (2024)
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
by: Sokar, Ghada, et al.
Published: (2024)
by: Sokar, Ghada, et al.
Published: (2024)
Mixture of Experts in a Mixture of RL settings
by: Willi, Timon, et al.
Published: (2024)
by: Willi, Timon, et al.
Published: (2024)
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
by: Obando-Ceron, Johan, et al.
Published: (2025)
by: Obando-Ceron, Johan, et al.
Published: (2025)
Action Mapping for Reinforcement Learning in Continuous Environments with Constraints
by: Theile, Mirco, et al.
Published: (2024)
by: Theile, Mirco, et al.
Published: (2024)
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
by: Garcin, Samuel, et al.
Published: (2024)
by: Garcin, Samuel, et al.
Published: (2024)
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
by: van Remmerden, Jesse, et al.
Published: (2024)
by: van Remmerden, Jesse, et al.
Published: (2024)
Foundations of Multivariate Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
CRAFT: Forgetting-Aware Intervention-Based Adaptation for Continual Learning
by: Hossen, Md Anwar, et al.
Published: (2026)
by: Hossen, Md Anwar, et al.
Published: (2026)
Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Data
by: van Remmerden, Jesse, et al.
Published: (2025)
by: van Remmerden, Jesse, et al.
Published: (2025)
Learning to Compile Programs to Neural Networks
by: Weber, Logan, et al.
Published: (2024)
by: Weber, Logan, et al.
Published: (2024)
MADQRL: Distributed Quantum Reinforcement Learning Framework for Multi-Agent Environments
by: Sawaika, Abhishek, et al.
Published: (2026)
by: Sawaika, Abhishek, et al.
Published: (2026)
STEP: Learning STructured Embeddings for Progressive Time Series
by: Thil, Lucas, et al.
Published: (2026)
by: Thil, Lucas, et al.
Published: (2026)
ED2: Environment Dynamics Decomposition World Models for Continuous Control
by: Hao, Jianye, et al.
Published: (2021)
by: Hao, Jianye, et al.
Published: (2021)
Similar Items
-
Mixtures of Experts Unlock Parameter Scaling for Deep RL
by: Obando-Ceron, Johan, et al.
Published: (2024) -
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
by: Jain, Arnav Kumar, et al.
Published: (2024) -
Temporal Difference Flows
by: Farebrother, Jesse, et al.
Published: (2025) -
Compositional Planning with Jumpy World Models
by: Farebrother, Jesse, et al.
Published: (2026) -
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
by: Farebrother, Jesse, et al.
Published: (2024)