Saved in:
| Main Authors: | Ding, Shutong, Hu, Ke, Zhang, Zhenhao, Ren, Kan, Zhang, Weinan, Yu, Jingyi, Wang, Jingya, Shi, Ye |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.16173 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
by: Ding, Shutong, et al.
Published: (2025)
by: Ding, Shutong, et al.
Published: (2025)
Sample-Efficient Diffusion-based Reinforcement Learning with Critic Guidance
by: Ding, Shutong, et al.
Published: (2026)
by: Ding, Shutong, et al.
Published: (2026)
Distributional Reinforcement Learning with Diffusion Bridge Critics
by: Ding, Shutong, et al.
Published: (2026)
by: Ding, Shutong, et al.
Published: (2026)
Guidance with Spherical Gaussian Constraint for Conditional Diffusion
by: Yang, Lingxiao, et al.
Published: (2024)
by: Yang, Lingxiao, et al.
Published: (2024)
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion
by: Fan, Yahao, et al.
Published: (2025)
by: Fan, Yahao, et al.
Published: (2025)
Diffusion-based learning framework for Constrained Nonconvex Optimization with Weighted Bootstrapped Refinement
by: Ding, Shutong, et al.
Published: (2025)
by: Ding, Shutong, et al.
Published: (2025)
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
by: Sun, Mingyang, et al.
Published: (2025)
by: Sun, Mingyang, et al.
Published: (2025)
Iterative Refinement of Flow Policies in Probability Space for Online Reinforcement Learning
by: Sun, Mingyang, et al.
Published: (2025)
by: Sun, Mingyang, et al.
Published: (2025)
SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-based Humanoid Control
by: Zhang, Jingyan, et al.
Published: (2026)
by: Zhang, Jingyan, et al.
Published: (2026)
Reinforcing Language Agents via Policy Optimization with Action Decomposition
by: Wen, Muning, et al.
Published: (2024)
by: Wen, Muning, et al.
Published: (2024)
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
by: Zhou, Ruiwen, et al.
Published: (2023)
by: Zhou, Ruiwen, et al.
Published: (2023)
A Unified and Fast-Sampling Diffusion Bridge Framework via Stochastic Optimal Control
by: Pan, Mokai, et al.
Published: (2025)
by: Pan, Mokai, et al.
Published: (2025)
Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation
by: Liu, Zhaoyang, et al.
Published: (2025)
by: Liu, Zhaoyang, et al.
Published: (2025)
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
by: Mao, Liyuan, et al.
Published: (2024)
by: Mao, Liyuan, et al.
Published: (2024)
A Review of Online Diffusion Policy RL Algorithms for Scalable Robotic Control
by: Choi, Wonhyeok, et al.
Published: (2026)
by: Choi, Wonhyeok, et al.
Published: (2026)
Path-Space Mirror Descent for On-Policy Reinforcement Learning under the Generalized Schrödinger Bridge
by: Gong, Yuehu, et al.
Published: (2026)
by: Gong, Yuehu, et al.
Published: (2026)
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
by: Zhang, Ruoqi, et al.
Published: (2024)
by: Zhang, Ruoqi, et al.
Published: (2024)
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
by: Li, Guanghe, et al.
Published: (2024)
by: Li, Guanghe, et al.
Published: (2024)
Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects
by: Wang, Xihuai, et al.
Published: (2022)
by: Wang, Xihuai, et al.
Published: (2022)
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)
by: Gao, Chen-Xiao, et al.
Published: (2025)
DyDiff: Long-Horizon Rollout via Dynamics Diffusion for Offline Reinforcement Learning
by: Zhao, Hanye, et al.
Published: (2024)
by: Zhao, Hanye, et al.
Published: (2024)
UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026)
by: Zhang, Zhenhao, et al.
Published: (2026)
Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement
by: Wen, Muning, et al.
Published: (2024)
by: Wen, Muning, et al.
Published: (2024)
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability
by: Shi, Yingdong, et al.
Published: (2025)
by: Shi, Yingdong, et al.
Published: (2025)
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
by: Ma, Yunchang, et al.
Published: (2025)
by: Ma, Yunchang, et al.
Published: (2025)
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
by: Wu, Shutong, et al.
Published: (2025)
by: Wu, Shutong, et al.
Published: (2025)
CausalGDP: Causality-Guided Diffusion Policies for Reinforcement Learning
by: Xiao, Xiaofeng, et al.
Published: (2026)
by: Xiao, Xiaofeng, et al.
Published: (2026)
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)
by: Yao, Yihang, et al.
Published: (2023)
Diffusion Models for Reinforcement Learning: A Survey
by: Zhu, Zhengbang, et al.
Published: (2023)
by: Zhu, Zhengbang, et al.
Published: (2023)
Global and Local Prompts Cooperation via Optimal Transport for Federated Learning
by: Li, Hongxia, et al.
Published: (2024)
by: Li, Hongxia, et al.
Published: (2024)
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training
by: He, Haoran, et al.
Published: (2024)
by: He, Haoran, et al.
Published: (2024)
PADiff: Predictive and Adaptive Diffusion Policies for Ad Hoc Teamwork
by: Chan, Hohei, et al.
Published: (2025)
by: Chan, Hohei, et al.
Published: (2025)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)
by: Liu, Zongkai, et al.
Published: (2024)
Harmonizing Generalization and Personalization in Federated Prompt Learning
by: Cui, Tianyu, et al.
Published: (2024)
by: Cui, Tianyu, et al.
Published: (2024)
Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay
by: Chen, Feng, et al.
Published: (2024)
by: Chen, Feng, et al.
Published: (2024)
Q-Flow: Stable and Expressive Reinforcement Learning with Flow-Based Policy
by: Doo, JaeHyeok, et al.
Published: (2026)
by: Doo, JaeHyeok, et al.
Published: (2026)
THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
by: Wu, Qianyang, et al.
Published: (2024)
by: Wu, Qianyang, et al.
Published: (2024)
AffordDP: Generalizable Diffusion Policy with Transferable Affordance
by: Wu, Shijie, et al.
Published: (2024)
by: Wu, Shijie, et al.
Published: (2024)
One-Step Flow Q-Learning: Addressing the Diffusion Policy Bottleneck in Offline Reinforcement Learning
by: Nguyen, Thanh, et al.
Published: (2025)
by: Nguyen, Thanh, et al.
Published: (2025)
Similar Items
-
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning
by: Ding, Shutong, et al.
Published: (2025) -
Sample-Efficient Diffusion-based Reinforcement Learning with Critic Guidance
by: Ding, Shutong, et al.
Published: (2026) -
Distributional Reinforcement Learning with Diffusion Bridge Critics
by: Ding, Shutong, et al.
Published: (2026) -
Guidance with Spherical Gaussian Constraint for Conditional Diffusion
by: Yang, Lingxiao, et al.
Published: (2024) -
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion
by: Fan, Yahao, et al.
Published: (2025)