:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Neggatu, Natinael Solomon, Houssineau, Jeremie, Montana, Giovanni
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.00629
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evaluation-Time Policy Switching for Offline Reinforcement Learning
by: Neggatu, Natinael Solomon, et al.
Published: (2025)

Investigating Relational State Abstraction in Collaborative MARL
by: Utke, Sharlin, et al.
Published: (2024)

Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning
by: Zhu, Ting, et al.
Published: (2024)

Learning Partial Action Replacement in Offline MARL
by: Jin, Yue, et al.
Published: (2026)

Partial Action Replacement: Tackling Distribution Shift in Offline MARL
by: Jin, Yue, et al.
Published: (2025)

State-Constrained Offline Reinforcement Learning
by: Hepburn, Charles A., et al.
Published: (2024)

Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
by: Mark, Max Sobol, et al.
Published: (2024)

SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)

HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)

Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
by: He, Longxiang, et al.
Published: (2025)

Retrosynthesis Planning via Worst-path Policy Optimisation in Tree-structured MDPs
by: Wang, Mianchu, et al.
Published: (2025)

Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only
by: Xiao, Wei, et al.
Published: (2025)

An Empirical Study on the Effectiveness of Incorporating Offline RL As Online RL Subroutines
by: Su, Jianhai, et al.
Published: (2025)

Scalable Offline Model-Based RL with Action Chunks
by: Park, Kwanyoung, et al.
Published: (2025)

Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026)

GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
by: Wang, Mianchu, et al.
Published: (2023)

Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
by: Cheng, Jie, et al.
Published: (2024)

REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes
by: Ireland, David, et al.
Published: (2024)

DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
by: Kim, Changyeon, et al.
Published: (2025)

Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
by: Zhu, Yujie, et al.
Published: (2025)

COOPO: Cyclic Offline-Online Policy Optimization Algorithm
by: Liu, Qisai, et al.
Published: (2026)

Chain-of-Goals Hierarchical Policy for Long-Horizon Offline Goal-Conditioned RL
by: Choi, Jinwoo, et al.
Published: (2026)

Budgeting Counterfactual for Offline RL
by: Liu, Yao, et al.
Published: (2023)

Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)

Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies
by: Chen, Jiaqi, et al.
Published: (2025)

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
by: Nakamoto, Mitsuhiko, et al.
Published: (2023)

Achieving Collective Welfare in Multi-Agent Reinforcement Learning via Suggestion Sharing
by: Jin, Yue, et al.
Published: (2024)

From Offline to Online Memory-Free and Task-Free Continual Learning via Fine-Grained Hypergradients
by: Michel, Nicolas, et al.
Published: (2025)

Advancing Safe Mechanical Ventilation Using Offline RL With Hybrid Actions and Clinically Aligned Rewards
by: Yousuf, Muhammad Hamza, et al.
Published: (2025)

Soft Policy Optimization: Online Off-Policy RL for Sequence Models
by: Cohen, Taco, et al.
Published: (2025)

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
by: Kiyohara, Haruka, et al.
Published: (2023)

Offline RLAIF: Piloting VLM Feedback for RL via SFO
by: Beck, Jacob
Published: (2025)

Offline RL for Adaptive Policy Retrieval in Prior Authorization
by: Sharifullin, Ruslan, et al.
Published: (2026)

Selective Uncertainty Propagation in Offline RL
by: Krishnamurthy, Sanath Kumar, et al.
Published: (2023)

Decoupled Prioritized Resampling for Offline RL
by: Yue, Yang, et al.
Published: (2023)

Augmenting Offline RL with Unlabeled Data
by: Wang, Zhao, et al.
Published: (2024)

Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2024)

Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)

H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
by: Niu, Haoyi, et al.
Published: (2023)

Residual Q-Learning: Offline and Online Policy Customization without Value
by: Li, Chenran, et al.
Published: (2023)