Saved in:
| Main Authors: | Zhang, Yu, Yu, Rui, Yao, Zhipeng, Zhang, Wenyuan, Wang, Jun, Zhang, Liming |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.03324 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
by: Ye, Chenlu, et al.
Published: (2023)
by: Ye, Chenlu, et al.
Published: (2023)
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
by: Yu, Xudong, et al.
Published: (2024)
by: Yu, Xudong, et al.
Published: (2024)
Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning
by: Kim, Minung, et al.
Published: (2026)
by: Kim, Minung, et al.
Published: (2026)
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
by: Zhao, Xinbo, et al.
Published: (2024)
by: Zhao, Xinbo, et al.
Published: (2024)
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
by: Ma, Yunchang, et al.
Published: (2025)
by: Ma, Yunchang, et al.
Published: (2025)
Gap-Dependent Bounds for Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation
by: Zhang, Haochen, et al.
Published: (2026)
by: Zhang, Haochen, et al.
Published: (2026)
The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)
by: Mediratta, Ishita, et al.
Published: (2023)
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2024)
by: Yao, Yihang, et al.
Published: (2024)
Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning
by: Pang, Teng, et al.
Published: (2026)
by: Pang, Teng, et al.
Published: (2026)
Diffusion Policies for Risk-Averse Behavior Modeling in Offline Reinforcement Learning
by: Chen, Xiaocong, et al.
Published: (2024)
by: Chen, Xiaocong, et al.
Published: (2024)
Bridging Dynamics Gaps via Diffusion Schrödinger Bridge for Cross-Domain Reinforcement Learning
by: Zhang, Hanping, et al.
Published: (2026)
by: Zhang, Hanping, et al.
Published: (2026)
Dynamic Momentum Recalibration in Online Gradient Learning
by: Yao, Zhipeng, et al.
Published: (2026)
by: Yao, Zhipeng, et al.
Published: (2026)
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
by: Wang, Changhong, et al.
Published: (2024)
by: Wang, Changhong, et al.
Published: (2024)
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
by: Tarasov, Denis, et al.
Published: (2024)
by: Tarasov, Denis, et al.
Published: (2024)
Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation
by: Li, Shangzhe, et al.
Published: (2026)
by: Li, Shangzhe, et al.
Published: (2026)
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning
by: Liao, Luofeng, et al.
Published: (2021)
by: Liao, Luofeng, et al.
Published: (2021)
Horizon Reduction as Information Loss in Offline Reinforcement Learning
by: Nidadala, Uday Kumar, et al.
Published: (2025)
by: Nidadala, Uday Kumar, et al.
Published: (2025)
Switching the Loss Reduces the Cost in Batch (Offline) Reinforcement Learning
by: Ayoub, Alex, et al.
Published: (2024)
by: Ayoub, Alex, et al.
Published: (2024)
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
by: Wen, Xiaoyu, et al.
Published: (2023)
by: Wen, Xiaoyu, et al.
Published: (2023)
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2024)
by: Gao, Chen-Xiao, et al.
Published: (2024)
Federated Offline Reinforcement Learning
by: Zhou, Doudou, et al.
Published: (2022)
by: Zhou, Doudou, et al.
Published: (2022)
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
by: Gao, Yunkai, et al.
Published: (2025)
by: Gao, Yunkai, et al.
Published: (2025)
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
by: Yang, Rui, et al.
Published: (2023)
by: Yang, Rui, et al.
Published: (2023)
Energy-Weighted Flow Matching for Offline Reinforcement Learning
by: Zhang, Shiyuan, et al.
Published: (2025)
by: Zhang, Shiyuan, et al.
Published: (2025)
Rethinking Optimal Transport in Offline Reinforcement Learning
by: Asadulaev, Arip, et al.
Published: (2024)
by: Asadulaev, Arip, et al.
Published: (2024)
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)
by: Huang, Xiao, et al.
Published: (2025)
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
by: Li, Lanqing, et al.
Published: (2024)
by: Li, Lanqing, et al.
Published: (2024)
Bridging the Gap between Learning and Inference for Diffusion-Based Molecule Generation
by: Liu, Peidong, et al.
Published: (2024)
by: Liu, Peidong, et al.
Published: (2024)
DiM-TS: Bridge the Gap between Selective State Space Models and Time Series for Generative Modeling
by: Yao, Zihao, et al.
Published: (2025)
by: Yao, Zihao, et al.
Published: (2025)
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
by: Lin, Haoxin, et al.
Published: (2024)
by: Lin, Haoxin, et al.
Published: (2024)
Residuals-based Offline Reinforcement Learning
by: Zhu, Qing, et al.
Published: (2026)
by: Zhu, Qing, et al.
Published: (2026)
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
by: Qiao, Zhongjian, et al.
Published: (2025)
by: Qiao, Zhongjian, et al.
Published: (2025)
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
by: Zhang, Jing, et al.
Published: (2023)
by: Zhang, Jing, et al.
Published: (2023)
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
by: Liu, Xu-Hui, et al.
Published: (2024)
by: Liu, Xu-Hui, et al.
Published: (2024)
Augmenting Offline Reinforcement Learning with State-only Interactions
by: Li, Shangzhe, et al.
Published: (2024)
by: Li, Shangzhe, et al.
Published: (2024)
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
by: Yan, Teng, et al.
Published: (2024)
by: Yan, Teng, et al.
Published: (2024)
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
by: Wen, Xiaoyu, et al.
Published: (2024)
by: Wen, Xiaoyu, et al.
Published: (2024)
From Static Constraints to Dynamic Adaptation: Sample-Level Constraint Relaxation for Offline-to-Online Reinforcement Learning
by: Zu, Lipeng, et al.
Published: (2025)
by: Zu, Lipeng, et al.
Published: (2025)
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)
by: Gao, Chen-Xiao, et al.
Published: (2025)
Similar Items
-
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
by: Ye, Chenlu, et al.
Published: (2023) -
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
by: Yu, Xudong, et al.
Published: (2024) -
Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning
by: Kim, Minung, et al.
Published: (2026) -
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025) -
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
by: Zhao, Xinbo, et al.
Published: (2024)