Saved in:
| Main Authors: | Klissarov, Martin, Bagaria, Akhil, Luo, Ziyan, Konidaris, George, Precup, Doina, Machado, Marlos C. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.14045 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments
by: Luo, Ziyan, et al.
Published: (2025)
by: Luo, Ziyan, et al.
Published: (2025)
MaestroMotif: Skill Design from Artificial Intelligence Feedback
by: Klissarov, Martin, et al.
Published: (2024)
by: Klissarov, Martin, et al.
Published: (2024)
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
by: Zhang, Shuyuan, et al.
Published: (2025)
by: Zhang, Shuyuan, et al.
Published: (2025)
Fluid-Agent Reinforcement Learning
by: Sharma, Shishir, et al.
Published: (2026)
by: Sharma, Shishir, et al.
Published: (2026)
Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)
by: Alver, Safa, et al.
Published: (2024)
Parseval Regularization for Continual Reinforcement Learning
by: Chung, Wesley, et al.
Published: (2024)
by: Chung, Wesley, et al.
Published: (2024)
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)
by: Arnob, Samin Yeasar, et al.
Published: (2025)
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
by: Zhao, Mingde, et al.
Published: (2023)
by: Zhao, Mingde, et al.
Published: (2023)
Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)
by: Kamat, Anand, et al.
Published: (2020)
Functional Acceleration for Policy Mirror Descent
by: Chelu, Veronica, et al.
Published: (2024)
by: Chelu, Veronica, et al.
Published: (2024)
A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings
by: Alver, Safa, et al.
Published: (2022)
by: Alver, Safa, et al.
Published: (2022)
Demystifying the Recency Heuristic in Temporal-Difference Learning
by: Daley, Brett, et al.
Published: (2024)
by: Daley, Brett, et al.
Published: (2024)
Harnessing Discrete Representations For Continual Reinforcement Learning
by: Meyer, Edan, et al.
Published: (2023)
by: Meyer, Edan, et al.
Published: (2023)
Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems
by: Carr, Jonathan Colaço, et al.
Published: (2026)
by: Carr, Jonathan Colaço, et al.
Published: (2026)
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
by: Cherif, Lynn, et al.
Published: (2025)
by: Cherif, Lynn, et al.
Published: (2025)
An Analysis of Action-Value Temporal-Difference Methods That Learn State Values
by: Daley, Brett, et al.
Published: (2025)
by: Daley, Brett, et al.
Published: (2025)
On the Geometry of Reinforcement Learning in Continuous State and Action Spaces
by: Tiwari, Saket, et al.
Published: (2022)
by: Tiwari, Saket, et al.
Published: (2022)
Geometry of Neural Reinforcement Learning in Continuous State and Action Spaces
by: Tiwari, Saket, et al.
Published: (2025)
by: Tiwari, Saket, et al.
Published: (2025)
Code as Reward: Empowering Reinforcement Learning with VLMs
by: Venuto, David, et al.
Published: (2024)
by: Venuto, David, et al.
Published: (2024)
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning
by: Pramanik, Subhojeet, et al.
Published: (2023)
by: Pramanik, Subhojeet, et al.
Published: (2023)
From Pixels to Factors: Learning Independently Controllable State Variables for Reinforcement Learning
by: Rodriguez-Sanchez, Rafael, et al.
Published: (2025)
by: Rodriguez-Sanchez, Rafael, et al.
Published: (2025)
Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)
by: Jain, Arushi, et al.
Published: (2024)
Learning Markov State Abstractions for Deep Reinforcement Learning
by: Allen, Cameron, et al.
Published: (2021)
by: Allen, Cameron, et al.
Published: (2021)
The Laplacian Keyboard: Beyond the Linear Span
by: Chandrasekar, Siddarth, et al.
Published: (2026)
by: Chandrasekar, Siddarth, et al.
Published: (2026)
Proper Laplacian Representation Learning
by: Gomez, Diego, et al.
Published: (2023)
by: Gomez, Diego, et al.
Published: (2023)
Knowledge Retention for Continual Model-Based Reinforcement Learning
by: Sun, Yixiang, et al.
Published: (2025)
by: Sun, Yixiang, et al.
Published: (2025)
Model-based Reinforcement Learning for Parameterized Action Spaces
by: Zhang, Renhao, et al.
Published: (2024)
by: Zhang, Renhao, et al.
Published: (2024)
SCAR: Shapley Credit Assignment for More Efficient RLHF
by: Cao, Meng, et al.
Published: (2025)
by: Cao, Meng, et al.
Published: (2025)
Capacity-Constrained Continual Learning
by: Wen, Zheng, et al.
Published: (2025)
by: Wen, Zheng, et al.
Published: (2025)
Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)
by: Elelimy, Esraa, et al.
Published: (2025)
Learning Abstract World Model for Value-preserving Planning with Options
by: Rodriguez-Sanchez, Rafael, et al.
Published: (2024)
by: Rodriguez-Sanchez, Rafael, et al.
Published: (2024)
Rejecting Hallucinated State Targets during Planning
by: Zhao, Mingde, et al.
Published: (2024)
by: Zhao, Mingde, et al.
Published: (2024)
Benchmarking Partial Observability in Reinforcement Learning with a Suite of Memory-Improvable Domains
by: Tao, Ruo Yu, et al.
Published: (2025)
by: Tao, Ruo Yu, et al.
Published: (2025)
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks
by: Quartey, Benedict, et al.
Published: (2023)
by: Quartey, Benedict, et al.
Published: (2023)
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling
by: Ishfaq, Haque, et al.
Published: (2024)
by: Ishfaq, Haque, et al.
Published: (2024)
Deep Double Q-learning
by: Nagarajan, Prabhat, et al.
Published: (2025)
by: Nagarajan, Prabhat, et al.
Published: (2025)
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)
by: Patil, Gandharv, et al.
Published: (2022)
Learning to Learn from Language Feedback with Social Meta-Learning
by: Cook, Jonathan, et al.
Published: (2026)
by: Cook, Jonathan, et al.
Published: (2026)
The Cell Must Go On: Agar.io for Continual Reinforcement Learning
by: Mohamed, Mohamed A., et al.
Published: (2025)
by: Mohamed, Mohamed A., et al.
Published: (2025)
Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network
by: Hsiao, Vincent, et al.
Published: (2025)
by: Hsiao, Vincent, et al.
Published: (2025)
Similar Items
-
Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments
by: Luo, Ziyan, et al.
Published: (2025) -
MaestroMotif: Skill Design from Artificial Intelligence Feedback
by: Klissarov, Martin, et al.
Published: (2024) -
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
by: Zhang, Shuyuan, et al.
Published: (2025) -
Fluid-Agent Reinforcement Learning
by: Sharma, Shishir, et al.
Published: (2026) -
Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)