Saved in:
| Main Authors: | Neggatu, Natinael Solomon, Houssineau, Jeremie, Montana, Giovanni |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00629 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluation-Time Policy Switching for Offline Reinforcement Learning
by: Neggatu, Natinael Solomon, et al.
Published: (2025)
by: Neggatu, Natinael Solomon, et al.
Published: (2025)
Investigating Relational State Abstraction in Collaborative MARL
by: Utke, Sharlin, et al.
Published: (2024)
by: Utke, Sharlin, et al.
Published: (2024)
Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning
by: Zhu, Ting, et al.
Published: (2024)
by: Zhu, Ting, et al.
Published: (2024)
Learning Partial Action Replacement in Offline MARL
by: Jin, Yue, et al.
Published: (2026)
by: Jin, Yue, et al.
Published: (2026)
Partial Action Replacement: Tackling Distribution Shift in Offline MARL
by: Jin, Yue, et al.
Published: (2025)
by: Jin, Yue, et al.
Published: (2025)
State-Constrained Offline Reinforcement Learning
by: Hepburn, Charles A., et al.
Published: (2024)
by: Hepburn, Charles A., et al.
Published: (2024)
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
by: Mark, Max Sobol, et al.
Published: (2024)
by: Mark, Max Sobol, et al.
Published: (2024)
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)
by: Zhang, Liyu, et al.
Published: (2024)
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)
by: Park, Seohong, et al.
Published: (2023)
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
by: He, Longxiang, et al.
Published: (2025)
by: He, Longxiang, et al.
Published: (2025)
Retrosynthesis Planning via Worst-path Policy Optimisation in Tree-structured MDPs
by: Wang, Mianchu, et al.
Published: (2025)
by: Wang, Mianchu, et al.
Published: (2025)
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only
by: Xiao, Wei, et al.
Published: (2025)
by: Xiao, Wei, et al.
Published: (2025)
An Empirical Study on the Effectiveness of Incorporating Offline RL As Online RL Subroutines
by: Su, Jianhai, et al.
Published: (2025)
by: Su, Jianhai, et al.
Published: (2025)
Scalable Offline Model-Based RL with Action Chunks
by: Park, Kwanyoung, et al.
Published: (2025)
by: Park, Kwanyoung, et al.
Published: (2025)
Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026)
by: Ni, Yao, et al.
Published: (2026)
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
by: Wang, Mianchu, et al.
Published: (2023)
by: Wang, Mianchu, et al.
Published: (2023)
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
by: Cheng, Jie, et al.
Published: (2024)
by: Cheng, Jie, et al.
Published: (2024)
REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes
by: Ireland, David, et al.
Published: (2024)
by: Ireland, David, et al.
Published: (2024)
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
by: Kim, Changyeon, et al.
Published: (2025)
by: Kim, Changyeon, et al.
Published: (2025)
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
by: Zhu, Yujie, et al.
Published: (2025)
by: Zhu, Yujie, et al.
Published: (2025)
COOPO: Cyclic Offline-Online Policy Optimization Algorithm
by: Liu, Qisai, et al.
Published: (2026)
by: Liu, Qisai, et al.
Published: (2026)
Chain-of-Goals Hierarchical Policy for Long-Horizon Offline Goal-Conditioned RL
by: Choi, Jinwoo, et al.
Published: (2026)
by: Choi, Jinwoo, et al.
Published: (2026)
Budgeting Counterfactual for Offline RL
by: Liu, Yao, et al.
Published: (2023)
by: Liu, Yao, et al.
Published: (2023)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
by: Chen, Jiaqi, et al.
Published: (2025)
by: Chen, Jiaqi, et al.
Published: (2025)
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
by: Nakamoto, Mitsuhiko, et al.
Published: (2023)
by: Nakamoto, Mitsuhiko, et al.
Published: (2023)
Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing
by: Jin, Yue, et al.
Published: (2024)
by: Jin, Yue, et al.
Published: (2024)
From Offline to Online Memory-Free and Task-Free Continual Learning via Fine-Grained Hypergradients
by: Michel, Nicolas, et al.
Published: (2025)
by: Michel, Nicolas, et al.
Published: (2025)
Advancing Safe Mechanical Ventilation Using Offline RL With Hybrid Actions and Clinically Aligned Rewards
by: Yousuf, Muhammad Hamza, et al.
Published: (2025)
by: Yousuf, Muhammad Hamza, et al.
Published: (2025)
Soft Policy Optimization: Online Off-Policy RL for Sequence Models
by: Cohen, Taco, et al.
Published: (2025)
by: Cohen, Taco, et al.
Published: (2025)
SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
by: Kiyohara, Haruka, et al.
Published: (2023)
by: Kiyohara, Haruka, et al.
Published: (2023)
Offline RLAIF: Piloting VLM Feedback for RL via SFO
by: Beck, Jacob
Published: (2025)
by: Beck, Jacob
Published: (2025)
Offline RL for Adaptive Policy Retrieval in Prior Authorization
by: Sharifullin, Ruslan, et al.
Published: (2026)
by: Sharifullin, Ruslan, et al.
Published: (2026)
Selective Uncertainty Propagation in Offline RL
by: Krishnamurthy, Sanath Kumar, et al.
Published: (2023)
by: Krishnamurthy, Sanath Kumar, et al.
Published: (2023)
Decoupled Prioritized Resampling for Offline RL
by: Yue, Yang, et al.
Published: (2023)
by: Yue, Yang, et al.
Published: (2023)
Augmenting Offline RL with Unlabeled Data
by: Wang, Zhao, et al.
Published: (2024)
by: Wang, Zhao, et al.
Published: (2024)
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2024)
by: Alles, Marvin, et al.
Published: (2024)
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)
by: Huang, Xiao, et al.
Published: (2025)
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
by: Niu, Haoyi, et al.
Published: (2023)
by: Niu, Haoyi, et al.
Published: (2023)
Residual Q-Learning: Offline and Online Policy Customization without Value
by: Li, Chenran, et al.
Published: (2023)
by: Li, Chenran, et al.
Published: (2023)
Similar Items
-
Evaluation-Time Policy Switching for Offline Reinforcement Learning
by: Neggatu, Natinael Solomon, et al.
Published: (2025) -
Investigating Relational State Abstraction in Collaborative MARL
by: Utke, Sharlin, et al.
Published: (2024) -
Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning
by: Zhu, Ting, et al.
Published: (2024) -
Learning Partial Action Replacement in Offline MARL
by: Jin, Yue, et al.
Published: (2026) -
Partial Action Replacement: Tackling Distribution Shift in Offline MARL
by: Jin, Yue, et al.
Published: (2025)