Saved in:
| Main Authors: | Chen, Xuyang, Yan, Keyu, Cao, Wenhan, Zhao, Lin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.05126 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
by: Mao, Yixiu, et al.
Published: (2024)
by: Mao, Yixiu, et al.
Published: (2024)
One-Step Sampler for Boltzmann Distributions via Drifting
by: Cao, Wenhan, et al.
Published: (2026)
by: Cao, Wenhan, et al.
Published: (2026)
Variational OOD State Correction for Offline Reinforcement Learning
by: Jiang, Ke, et al.
Published: (2025)
by: Jiang, Ke, et al.
Published: (2025)
Active Advantage-Aligned Online Reinforcement Learning with Offline Data
by: Liu, Xuefeng, et al.
Published: (2025)
by: Liu, Xuefeng, et al.
Published: (2025)
An Advantage-based Optimization Method for Reinforcement Learning in Large Action Space
by: Lin, Hai, et al.
Published: (2024)
by: Lin, Hai, et al.
Published: (2024)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
by: Li, Xuyang, et al.
Published: (2025)
by: Li, Xuyang, et al.
Published: (2025)
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
by: Liu, Tenglong, et al.
Published: (2024)
by: Liu, Tenglong, et al.
Published: (2024)
Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control
by: Cao, Wenhan, et al.
Published: (2024)
by: Cao, Wenhan, et al.
Published: (2024)
Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Flow Matching for Offline Reinforcement Learning with Discrete Actions
by: Khan, Fairoz Nower, et al.
Published: (2026)
by: Khan, Fairoz Nower, et al.
Published: (2026)
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
by: Beeson, Alex, et al.
Published: (2024)
by: Beeson, Alex, et al.
Published: (2024)
FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning
by: Koirala, Prajwal, et al.
Published: (2024)
by: Koirala, Prajwal, et al.
Published: (2024)
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2024)
by: Alles, Marvin, et al.
Published: (2024)
Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
by: Xu, Yinglun, et al.
Published: (2023)
by: Xu, Yinglun, et al.
Published: (2023)
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
by: Qing, Yunpeng, et al.
Published: (2024)
by: Qing, Yunpeng, et al.
Published: (2024)
M3OOD: Automatic Selection of Multimodal OOD Detectors
by: Qin, Yuehan, et al.
Published: (2025)
by: Qin, Yuehan, et al.
Published: (2025)
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
Offline Reinforcement Learning with Penalized Action Noise Injection
by: Oh, JunHyeok, et al.
Published: (2025)
by: Oh, JunHyeok, et al.
Published: (2025)
Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)
by: Zhao, Ziqi, et al.
Published: (2024)
Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood
by: Yao, Qingmao, et al.
Published: (2025)
by: Yao, Qingmao, et al.
Published: (2025)
Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)
by: Ma, Xiao, et al.
Published: (2022)
CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
by: Huang, Dongchi, et al.
Published: (2025)
by: Huang, Dongchi, et al.
Published: (2025)
BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces
by: Landers, Matthew, et al.
Published: (2024)
by: Landers, Matthew, et al.
Published: (2024)
ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning
by: Ren, Qingnan, et al.
Published: (2026)
by: Ren, Qingnan, et al.
Published: (2026)
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)
by: Zhang, Liyu, et al.
Published: (2024)
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
by: Cao, Hongye, et al.
Published: (2025)
by: Cao, Hongye, et al.
Published: (2025)
Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning
by: Dong, Jinzong, et al.
Published: (2026)
by: Dong, Jinzong, et al.
Published: (2026)
Improved AdaBoost for Virtual Reality Experience Prediction Based on Long Short-Term Memory Network
by: Fan, Wenhan, et al.
Published: (2024)
by: Fan, Wenhan, et al.
Published: (2024)
Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data
by: Kim, Jeonghye, et al.
Published: (2025)
by: Kim, Jeonghye, et al.
Published: (2025)
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
by: Pan, Minting, et al.
Published: (2025)
by: Pan, Minting, et al.
Published: (2025)
An Information-Theoretic Analysis of OOD Generalization in Meta-Reinforcement Learning
by: Liu, Xingtu
Published: (2025)
by: Liu, Xingtu
Published: (2025)
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)
by: Li, Zongyue, et al.
Published: (2025)
Information-Directed Offline-to-Online Reinforcement Learning
by: Chen, Keru
Published: (2026)
by: Chen, Keru
Published: (2026)
Finite-time analysis of single-timescale actor-critic
by: Chen, Xuyang, et al.
Published: (2022)
by: Chen, Xuyang, et al.
Published: (2022)
Robustness Evaluation of Offline Reinforcement Learning for Robot Control Against Action Perturbations
by: Ayabe, Shingo, et al.
Published: (2024)
by: Ayabe, Shingo, et al.
Published: (2024)
MetaOOD: Automatic Selection of OOD Detection Models
by: Qin, Yuehan, et al.
Published: (2024)
by: Qin, Yuehan, et al.
Published: (2024)
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2024)
by: Yao, Yihang, et al.
Published: (2024)
Advantage-Guided Diffusion for Model-Based Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2026)
by: Foffano, Daniele, et al.
Published: (2026)
Similar Items
-
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025) -
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
by: Mao, Yixiu, et al.
Published: (2024) -
One-Step Sampler for Boltzmann Distributions via Drifting
by: Cao, Wenhan, et al.
Published: (2026) -
Variational OOD State Correction for Offline Reinforcement Learning
by: Jiang, Ke, et al.
Published: (2025) -
Active Advantage-Aligned Online Reinforcement Learning with Offline Data
by: Liu, Xuefeng, et al.
Published: (2025)