Saved in:
| Main Authors: | Chaudhary, Gaurav, Mondal, Wassim Uddin, Behera, Laxmidhar |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.09574 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning
by: Chaudhary, Gaurav, et al.
Published: (2025)
by: Chaudhary, Gaurav, et al.
Published: (2025)
Match or Replay: Self Imitating Proximal Policy Optimization
by: Chaudhary, Gaurav, et al.
Published: (2026)
by: Chaudhary, Gaurav, et al.
Published: (2026)
TEACH: Temporal Variance-Driven Curriculum for Reinforcement Learning
by: Chaudhary, Gaurav, et al.
Published: (2025)
by: Chaudhary, Gaurav, et al.
Published: (2025)
Sample-Efficient Constrained Reinforcement Learning with General Parameterization
by: Mondal, Washim Uddin, et al.
Published: (2024)
by: Mondal, Washim Uddin, et al.
Published: (2024)
Online Pre-Training for Offline-to-Online Reinforcement Learning
by: Shin, Yongjae, et al.
Published: (2025)
by: Shin, Yongjae, et al.
Published: (2025)
Efficient and Uncertainty-Aware Diffusion Framework for Offline-to-Online Reinforcement Learning
by: Bui, Ha Manh, et al.
Published: (2026)
by: Bui, Ha Manh, et al.
Published: (2026)
A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach
by: Ganesh, Swetha, et al.
Published: (2024)
by: Ganesh, Swetha, et al.
Published: (2024)
Finite-Sample Analysis of Policy Evaluation for Robust Average Reward Reinforcement Learning
by: Xu, Yang, et al.
Published: (2025)
by: Xu, Yang, et al.
Published: (2025)
Information-Directed Offline-to-Online Reinforcement Learning
by: Chen, Keru
Published: (2026)
by: Chen, Keru
Published: (2026)
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
by: Guo, Siyuan, et al.
Published: (2023)
by: Guo, Siyuan, et al.
Published: (2023)
Dynamic Hand Gesture Recognition for Robot Manipulator Tasks
by: Sharma, Dharmendra, et al.
Published: (2026)
by: Sharma, Dharmendra, et al.
Published: (2026)
Bayesian Design Principles for Offline-to-Online Reinforcement Learning
by: Hu, Hao, et al.
Published: (2024)
by: Hu, Hao, et al.
Published: (2024)
Online Optimization for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2025)
by: Chemingui, Yassine, et al.
Published: (2025)
The Three Regimes of Offline-to-Online Reinforcement Learning
by: Li, Lu, et al.
Published: (2025)
by: Li, Lu, et al.
Published: (2025)
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms
by: Aggarwal, Vaneet, et al.
Published: (2024)
by: Aggarwal, Vaneet, et al.
Published: (2024)
Governance-as-a-Service: A Multi-Agent Framework for AI System Compliance and Policy Enforcement
by: Gaurav, Suyash, et al.
Published: (2025)
by: Gaurav, Suyash, et al.
Published: (2025)
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
by: Wang, Qi, et al.
Published: (2023)
by: Wang, Qi, et al.
Published: (2023)
Adaptive Q-Chunking for Offline-to-Online Reinforcement Learning
by: Gireesh, Nandiraju, et al.
Published: (2026)
by: Gireesh, Nandiraju, et al.
Published: (2026)
Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
by: Mondal, Washim Uddin, et al.
Published: (2022)
by: Mondal, Washim Uddin, et al.
Published: (2022)
A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement Learning
by: Kim, JaeYoon, et al.
Published: (2024)
by: Kim, JaeYoon, et al.
Published: (2024)
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
by: Liu, Shirong, et al.
Published: (2024)
by: Liu, Shirong, et al.
Published: (2024)
Active Advantage-Aligned Online Reinforcement Learning with Offline Data
by: Liu, Xuefeng, et al.
Published: (2025)
by: Liu, Xuefeng, et al.
Published: (2025)
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
by: Liu, Xu-Hui, et al.
Published: (2024)
by: Liu, Xu-Hui, et al.
Published: (2024)
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
by: McInroe, Trevor, et al.
Published: (2023)
by: McInroe, Trevor, et al.
Published: (2023)
Adaptive Replay Buffer for Offline-to-Online Reinforcement Learning
by: Song, Chihyeon, et al.
Published: (2025)
by: Song, Chihyeon, et al.
Published: (2025)
Offline-Online Reinforcement Learning for Linear Mixture MDPs
by: Zhang, Zhongjun, et al.
Published: (2026)
by: Zhang, Zhongjun, et al.
Published: (2026)
Discrete Flow Matching for Offline-to-Online Reinforcement Learning
by: Khan, Fairoz Nower, et al.
Published: (2026)
by: Khan, Fairoz Nower, et al.
Published: (2026)
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
by: Nakhaei, Mohammadreza, et al.
Published: (2024)
by: Nakhaei, Mohammadreza, et al.
Published: (2024)
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
by: Wen, Xiaoyu, et al.
Published: (2023)
by: Wen, Xiaoyu, et al.
Published: (2023)
Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data
by: Deng, Shilong, et al.
Published: (2025)
by: Deng, Shilong, et al.
Published: (2025)
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
by: Shen, Yi, et al.
Published: (2024)
by: Shen, Yi, et al.
Published: (2024)
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)
by: Huang, Xiao, et al.
Published: (2025)
RLSynC: Offline-Online Reinforcement Learning for Synthon Completion
by: Baker, Frazier N., et al.
Published: (2023)
by: Baker, Frazier N., et al.
Published: (2023)
Flow Matching with Injected Noise for Offline-to-Online Reinforcement Learning
by: Shin, Yongjae, et al.
Published: (2026)
by: Shin, Yongjae, et al.
Published: (2026)
Online Symbolic Music Alignment with Offline Reinforcement Learning
by: Peter, Silvan David
Published: (2023)
by: Peter, Silvan David
Published: (2023)
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)
by: Zhang, Liyu, et al.
Published: (2024)
Stable Online and Offline Reinforcement Learning for Antibody CDRH3 Design
by: Vogt, Yannick, et al.
Published: (2023)
by: Vogt, Yannick, et al.
Published: (2023)
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
by: Zhao, Kai, et al.
Published: (2023)
by: Zhao, Kai, et al.
Published: (2023)
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data
by: Zhou, Zhiyuan, et al.
Published: (2024)
by: Zhou, Zhiyuan, et al.
Published: (2024)
Behavior-Adaptive Q-Learning: A Unifying Framework for Offline-to-Online RL
by: Zu, Lipeng, et al.
Published: (2025)
by: Zu, Lipeng, et al.
Published: (2025)
Similar Items
-
From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning
by: Chaudhary, Gaurav, et al.
Published: (2025) -
Match or Replay: Self Imitating Proximal Policy Optimization
by: Chaudhary, Gaurav, et al.
Published: (2026) -
TEACH: Temporal Variance-Driven Curriculum for Reinforcement Learning
by: Chaudhary, Gaurav, et al.
Published: (2025) -
Sample-Efficient Constrained Reinforcement Learning with General Parameterization
by: Mondal, Washim Uddin, et al.
Published: (2024) -
Online Pre-Training for Offline-to-Online Reinforcement Learning
by: Shin, Yongjae, et al.
Published: (2025)