Saved in:
| Main Authors: | Fang, Linjiajie, Liu, Ruoxue, Zhang, Jing, Wang, Wenjia, Jing, Bing-Yi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.20555 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
by: Zhang, Jing, et al.
Published: (2023)
by: Zhang, Jing, et al.
Published: (2023)
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
by: Zhang, Jing, et al.
Published: (2024)
by: Zhang, Jing, et al.
Published: (2024)
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
by: He, Longxiang, et al.
Published: (2023)
by: He, Longxiang, et al.
Published: (2023)
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Flow Actor-Critic for Offline Reinforcement Learning
by: Chae, Jongseong, et al.
Published: (2026)
by: Chae, Jongseong, et al.
Published: (2026)
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
by: Wei, Honghao, et al.
Published: (2024)
by: Wei, Honghao, et al.
Published: (2024)
Offline Actor-Critic Reinforcement Learning Scales to Large Models
by: Springenberg, Jost Tobias, et al.
Published: (2024)
by: Springenberg, Jost Tobias, et al.
Published: (2024)
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
by: Ma, Yunchang, et al.
Published: (2025)
by: Ma, Yunchang, et al.
Published: (2025)
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare
by: Fang, Nan, et al.
Published: (2024)
by: Fang, Nan, et al.
Published: (2024)
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
by: Jing, Tan, et al.
Published: (2025)
by: Jing, Tan, et al.
Published: (2025)
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
by: Zhang, Lunjun, et al.
Published: (2025)
by: Zhang, Lunjun, et al.
Published: (2025)
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
by: Xu, Linjie, et al.
Published: (2023)
by: Xu, Linjie, et al.
Published: (2023)
Diffusion Policies for Risk-Averse Behavior Modeling in Offline Reinforcement Learning
by: Chen, Xiaocong, et al.
Published: (2024)
by: Chen, Xiaocong, et al.
Published: (2024)
Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
by: Zeng, Sihan, et al.
Published: (2024)
by: Zeng, Sihan, et al.
Published: (2024)
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)
by: Gao, Chen-Xiao, et al.
Published: (2025)
Diffusion Actor-Critic with Entropy Regulator
by: Wang, Yinuo, et al.
Published: (2024)
by: Wang, Yinuo, et al.
Published: (2024)
Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning
by: Dong, Jinzong, et al.
Published: (2026)
by: Dong, Jinzong, et al.
Published: (2026)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
LLM-Driven Policy Diffusion: Enhancing Generalization in Offline Reinforcement Learning
by: Zhang, Hanping, et al.
Published: (2025)
by: Zhang, Hanping, et al.
Published: (2025)
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
A Single-Loop Deep Actor-Critic Algorithm for Constrained Reinforcement Learning with Provable Convergence
by: Wang, Kexuan, et al.
Published: (2023)
by: Wang, Kexuan, et al.
Published: (2023)
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
by: Zhang, Ruoqi, et al.
Published: (2024)
by: Zhang, Ruoqi, et al.
Published: (2024)
Actor-Critic Reinforcement Learning with Phased Actor
by: Wu, Ruofan, et al.
Published: (2024)
by: Wu, Ruofan, et al.
Published: (2024)
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
by: Mao, Liyuan, et al.
Published: (2024)
by: Mao, Liyuan, et al.
Published: (2024)
Risk-sensitive Actor-Critic with Static Spectral Risk Measures for Online and Offline Reinforcement Learning
by: Moghimi, Mehrdad, et al.
Published: (2025)
by: Moghimi, Mehrdad, et al.
Published: (2025)
CODA: Coordination via On-Policy Diffusion for Multi-Agent Offline Reinforcement Learning
by: Hedman, Marcel, et al.
Published: (2026)
by: Hedman, Marcel, et al.
Published: (2026)
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
by: Liu, Xu-Hui, et al.
Published: (2024)
by: Liu, Xu-Hui, et al.
Published: (2024)
Enhanced Bayesian Personalized Ranking for Robust Hard Negative Sampling in Recommender Systems
by: Shi, Kexin, et al.
Published: (2024)
by: Shi, Kexin, et al.
Published: (2024)
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning
by: Ada, Suzan Ece, et al.
Published: (2023)
by: Ada, Suzan Ece, et al.
Published: (2023)
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning
by: Chen, Haohui, et al.
Published: (2024)
by: Chen, Haohui, et al.
Published: (2024)
CausalCOMRL: Context-Based Offline Meta-Reinforcement Learning with Causal Representation
by: Zhang, Zhengzhe, et al.
Published: (2025)
by: Zhang, Zhengzhe, et al.
Published: (2025)
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning
by: Hu, Xuemin, et al.
Published: (2024)
by: Hu, Xuemin, et al.
Published: (2024)
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning
by: Koirala, Prajwal, et al.
Published: (2024)
by: Koirala, Prajwal, et al.
Published: (2024)
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)
by: Huang, Xiao, et al.
Published: (2025)
One-Step Flow Q-Learning: Addressing the Diffusion Policy Bottleneck in Offline Reinforcement Learning
by: Nguyen, Thanh, et al.
Published: (2025)
by: Nguyen, Thanh, et al.
Published: (2025)
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms
by: Panda, Prashansa, et al.
Published: (2023)
by: Panda, Prashansa, et al.
Published: (2023)
Relative Importance Sampling for off-Policy Actor-Critic in Deep Reinforcement Learning
by: Humayoo, Mahammad, et al.
Published: (2018)
by: Humayoo, Mahammad, et al.
Published: (2018)
Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025)
by: Vo, Thanh Vinh, et al.
Published: (2025)
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
by: Shen, Yi, et al.
Published: (2024)
by: Shen, Yi, et al.
Published: (2024)
CCS: Controllable and Constrained Sampling with Diffusion Models via Initial Noise Perturbation
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
Similar Items
-
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
by: Zhang, Jing, et al.
Published: (2023) -
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
by: Zhang, Jing, et al.
Published: (2024) -
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
by: He, Longxiang, et al.
Published: (2023) -
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025) -
Flow Actor-Critic for Offline Reinforcement Learning
by: Chae, Jongseong, et al.
Published: (2026)