:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Xuyang, Yan, Keyu, Cao, Wenhan, Zhao, Lin
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2505.05126
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)

Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
by: Mao, Yixiu, et al.
Published: (2024)

One-Step Sampler for Boltzmann Distributions via Drifting
by: Cao, Wenhan, et al.
Published: (2026)

Variational OOD State Correction for Offline Reinforcement Learning
by: Jiang, Ke, et al.
Published: (2025)

Active Advantage-Aligned Online Reinforcement Learning with Offline Data
by: Liu, Xuefeng, et al.
Published: (2025)

An Advantage-based Optimization Method for Reinforcement Learning in Large Action Space
by: Lin, Hai, et al.
Published: (2024)

Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)

SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
by: Li, Xuyang, et al.
Published: (2025)

Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
by: Liu, Tenglong, et al.
Published: (2024)

Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control
by: Cao, Wenhan, et al.
Published: (2024)

Global Optimality of Single-Timescale Actor-Critic under Continuous State-Action Space: A Study on Linear Quadratic Regulator
by: Chen, Xuyang, et al.
Published: (2025)

Flow Matching for Offline Reinforcement Learning with Discrete Actions
by: Khan, Fairoz Nower, et al.
Published: (2026)

An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
by: Beeson, Alex, et al.
Published: (2024)

FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning
by: Koirala, Prajwal, et al.
Published: (2024)

Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2024)

Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
by: Xu, Yinglun, et al.
Published: (2023)

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
by: Qing, Yunpeng, et al.
Published: (2024)

M3OOD: Automatic Selection of Multimodal OOD Detectors
by: Qin, Yuehan, et al.
Published: (2025)

Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)

Offline Reinforcement Learning with Penalized Action Noise Injection
by: Oh, JunHyeok, et al.
Published: (2025)

Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)

Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood
by: Yao, Qingmao, et al.
Published: (2025)

Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)

CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
by: Huang, Dongchi, et al.
Published: (2025)

BraVE: Offline Reinforcement Learning for Discrete Combinatorial Action Spaces
by: Landers, Matthew, et al.
Published: (2024)

ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning
by: Ren, Qingnan, et al.
Published: (2026)

SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)

Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
by: Cao, Hongye, et al.
Published: (2025)

Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning
by: Dong, Jinzong, et al.
Published: (2026)

Improved AdaBoost for Virtual Reality Experience Prediction Based on Long Short-Term Memory Network
by: Fan, Wenhan, et al.
Published: (2024)

Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data
by: Kim, Jeonghye, et al.
Published: (2025)

Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
by: Pan, Minting, et al.
Published: (2025)

An Information-Theoretic Analysis of OOD Generalization in Meta-Reinforcement Learning
by: Liu, Xingtu
Published: (2025)

DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)

Information-Directed Offline-to-Online Reinforcement Learning
by: Chen, Keru
Published: (2026)

Finite-time analysis of single-timescale actor-critic
by: Chen, Xuyang, et al.
Published: (2022)

Robustness Evaluation of Offline Reinforcement Learning for Robot Control Against Action Perturbations
by: Ayabe, Shingo, et al.
Published: (2024)

MetaOOD: Automatic Selection of OOD Detection Models
by: Qin, Yuehan, et al.
Published: (2024)

OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2024)

Advantage-Guided Diffusion for Model-Based Reinforcement Learning
by: Foffano, Daniele, et al.
Published: (2026)