Saved in:
| Main Authors: | Zhang, Jing, Zhang, Chi, Wang, Wenjia, Jing, Bing-Yi |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2301.12130 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
by: Fang, Linjiajie, et al.
Published: (2024)
by: Fang, Linjiajie, et al.
Published: (2024)
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)
by: Gao, Chen-Xiao, et al.
Published: (2025)
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
by: Xu, Linjie, et al.
Published: (2023)
by: Xu, Linjie, et al.
Published: (2023)
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
by: Zhang, Jing, et al.
Published: (2024)
by: Zhang, Jing, et al.
Published: (2024)
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
by: He, Longxiang, et al.
Published: (2023)
by: He, Longxiang, et al.
Published: (2023)
CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation
by: Zhang, Zhengzhe, et al.
Published: (2025)
by: Zhang, Zhengzhe, et al.
Published: (2025)
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
by: Jing, Tan, et al.
Published: (2025)
by: Jing, Tan, et al.
Published: (2025)
Diffusion Policies for Risk-Averse Behavior Modeling in Offline Reinforcement Learning
by: Chen, Xiaocong, et al.
Published: (2024)
by: Chen, Xiaocong, et al.
Published: (2024)
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
by: Ma, Yunchang, et al.
Published: (2025)
by: Ma, Yunchang, et al.
Published: (2025)
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning
by: Koirala, Prajwal, et al.
Published: (2024)
by: Koirala, Prajwal, et al.
Published: (2024)
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
by: Shen, Yi, et al.
Published: (2024)
by: Shen, Yi, et al.
Published: (2024)
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
by: Hu, Jifeng, et al.
Published: (2025)
by: Hu, Jifeng, et al.
Published: (2025)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2024)
by: Alles, Marvin, et al.
Published: (2024)
State-Constrained Offline Reinforcement Learning
by: Hepburn, Charles A., et al.
Published: (2024)
by: Hepburn, Charles A., et al.
Published: (2024)
Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices
by: Woo, Jiin, et al.
Published: (2024)
by: Woo, Jiin, et al.
Published: (2024)
Robust Offline Reinforcement Learning for Non-Markovian Decision Processes
by: Huang, Ruiquan, et al.
Published: (2024)
by: Huang, Ruiquan, et al.
Published: (2024)
Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization
by: Yuan, Haochen, et al.
Published: (2025)
by: Yuan, Haochen, et al.
Published: (2025)
Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design
by: Liu, Shuze, et al.
Published: (2023)
by: Liu, Shuze, et al.
Published: (2023)
Belief-Based Offline Reinforcement Learning for Delay-Robust Policy Optimization
by: Zhan, Simon Sinong, et al.
Published: (2025)
by: Zhan, Simon Sinong, et al.
Published: (2025)
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning
by: Khattar, Vanshaj, et al.
Published: (2024)
by: Khattar, Vanshaj, et al.
Published: (2024)
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
by: Kang, Hyungkyu, et al.
Published: (2025)
by: Kang, Hyungkyu, et al.
Published: (2025)
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
by: Mao, Yixiu, et al.
Published: (2025)
by: Mao, Yixiu, et al.
Published: (2025)
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?
by: Dai, Yang, et al.
Published: (2024)
by: Dai, Yang, et al.
Published: (2024)
Semi-gradient DICE for Offline Constrained Reinforcement Learning
by: Kim, Woosung, et al.
Published: (2025)
by: Kim, Woosung, et al.
Published: (2025)
Manifold-Constrained Energy-Based Transition Models for Offline Reinforcement Learning
by: Fang, Zeyu, et al.
Published: (2026)
by: Fang, Zeyu, et al.
Published: (2026)
Offline Reinforcement Learning with Generative Trajectory Policies
by: Feng, Xinsong, et al.
Published: (2025)
by: Feng, Xinsong, et al.
Published: (2025)
Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)
by: Zhao, Ziqi, et al.
Published: (2024)
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
by: Gao, Yunkai, et al.
Published: (2025)
by: Gao, Yunkai, et al.
Published: (2025)
LLM-Driven Policy Diffusion: Enhancing Generalization in Offline Reinforcement Learning
by: Zhang, Hanping, et al.
Published: (2025)
by: Zhang, Hanping, et al.
Published: (2025)
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
by: Mu, Zhancun, et al.
Published: (2026)
by: Mu, Zhancun, et al.
Published: (2026)
Long-Horizon Model-Based Offline Reinforcement Learning Without Explicit Conservatism
by: Ni, Tianwei, et al.
Published: (2025)
by: Ni, Tianwei, et al.
Published: (2025)
Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
by: Xu, Yinglun, et al.
Published: (2023)
by: Xu, Yinglun, et al.
Published: (2023)
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
by: Liu, Shirong, et al.
Published: (2024)
by: Liu, Shirong, et al.
Published: (2024)
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
by: Hu, Hao, et al.
Published: (2025)
by: Hu, Hao, et al.
Published: (2025)
Offline Constrained Reinforcement Learning under Partial Data Coverage
by: Ko, Seokmin, et al.
Published: (2025)
by: Ko, Seokmin, et al.
Published: (2025)
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
by: Ma, Shaocong, et al.
Published: (2025)
by: Ma, Shaocong, et al.
Published: (2025)
Behavior Preference Regression for Offline Reinforcement Learning
by: Srinivasan, Padmanaba, et al.
Published: (2025)
by: Srinivasan, Padmanaba, et al.
Published: (2025)
Offline Reinforcement Learning with Behavioral Supervisor Tuning
by: Srinivasan, Padmanaba, et al.
Published: (2024)
by: Srinivasan, Padmanaba, et al.
Published: (2024)
Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation
by: Hu, Haichen, et al.
Published: (2026)
by: Hu, Haichen, et al.
Published: (2026)
Similar Items
-
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
by: Fang, Linjiajie, et al.
Published: (2024) -
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025) -
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
by: Xu, Linjie, et al.
Published: (2023) -
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
by: Zhang, Jing, et al.
Published: (2024) -
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
by: He, Longxiang, et al.
Published: (2023)