Saved in:
| Main Authors: | Dong, Zhiqiang, Pang, Teng, Xu, Rongjian, Wu, Guoqiang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08960 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning
by: Pang, Teng, et al.
Published: (2026)
by: Pang, Teng, et al.
Published: (2026)
Equivariant Efficient Joint Discrete and Continuous MeanFlow for Molecular Graph Generation
by: Xu, Rongjian, et al.
Published: (2026)
by: Xu, Rongjian, et al.
Published: (2026)
Diffusion Classifier-Driven Reward for Offline Preference-based Reinforcement Learning
by: Pang, Teng, et al.
Published: (2025)
by: Pang, Teng, et al.
Published: (2025)
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
by: Myers, Vivek, et al.
Published: (2025)
by: Myers, Vivek, et al.
Published: (2025)
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
by: Wang, Mianchu, et al.
Published: (2023)
by: Wang, Mianchu, et al.
Published: (2023)
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
by: Wang, Mianchu, et al.
Published: (2023)
by: Wang, Mianchu, et al.
Published: (2023)
QHyer: Q-conditioned Hybrid Attention-mamba Transformer for Offline Goal-conditioned RL
by: Lei, Xing, et al.
Published: (2026)
by: Lei, Xing, et al.
Published: (2026)
PIQL: Projective Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning
by: Han, Xinchen, et al.
Published: (2025)
by: Han, Xinchen, et al.
Published: (2025)
Exclusively Penalized Q-learning for Offline Reinforcement Learning
by: Yeom, Junghyuk, et al.
Published: (2024)
by: Yeom, Junghyuk, et al.
Published: (2024)
Towards Efficient and Expressive Offline RL via Flow-Anchored Noise-conditioned Q-Learning
by: Lee, Sungyoung, et al.
Published: (2026)
by: Lee, Sungyoung, et al.
Published: (2026)
Causal Flow Q-Learning for Robust Offline Reinforcement Learning
by: Li, Mingxuan, et al.
Published: (2026)
by: Li, Mingxuan, et al.
Published: (2026)
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2025)
by: Alles, Marvin, et al.
Published: (2025)
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning
by: Huang, Xingshuai, et al.
Published: (2024)
by: Huang, Xingshuai, et al.
Published: (2024)
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
by: Yan, Teng, et al.
Published: (2024)
by: Yan, Teng, et al.
Published: (2024)
Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows
by: Garg, Shaswat, et al.
Published: (2026)
by: Garg, Shaswat, et al.
Published: (2026)
Abstraction for Offline Goal-Conditioned Reinforcement Learning
by: Wibault, Clarisse, et al.
Published: (2026)
by: Wibault, Clarisse, et al.
Published: (2026)
Imagination-Limited Q-Learning for Offline Reinforcement Learning
by: Liu, Wenhui, et al.
Published: (2025)
by: Liu, Wenhui, et al.
Published: (2025)
In-Context Compositional Q-Learning for Offline Reinforcement Learning
by: Xu, Qiushui, et al.
Published: (2025)
by: Xu, Qiushui, et al.
Published: (2025)
Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies
by: Tayal, Mumuksh, et al.
Published: (2026)
by: Tayal, Mumuksh, et al.
Published: (2026)
Reachability Weighted Offline Goal-conditioned Resampling
by: Yang, Wenyan, et al.
Published: (2025)
by: Yang, Wenyan, et al.
Published: (2025)
MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning
by: Lei, Xing, et al.
Published: (2024)
by: Lei, Xing, et al.
Published: (2024)
One-Step Flow Q-Learning: Addressing the Diffusion Policy Bottleneck in Offline Reinforcement Learning
by: Nguyen, Thanh, et al.
Published: (2025)
by: Nguyen, Thanh, et al.
Published: (2025)
Latent Representation Alignment for Offline Goal-Conditioned Reinforcement Learning
by: Kang, Hyungkyu, et al.
Published: (2026)
by: Kang, Hyungkyu, et al.
Published: (2026)
Test-time Offline Reinforcement Learning on Goal-related Experience
by: Bagatella, Marco, et al.
Published: (2025)
by: Bagatella, Marco, et al.
Published: (2025)
Offline Goal-Conditioned Reinforcement Learning with Projective Quasimetric Planning
by: Kobanda, Anthony, et al.
Published: (2025)
by: Kobanda, Anthony, et al.
Published: (2025)
Q-value Regularized Transformer for Offline Reinforcement Learning
by: Hu, Shengchao, et al.
Published: (2024)
by: Hu, Shengchao, et al.
Published: (2024)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)
by: Manenti, Massimiliano, et al.
Published: (2025)
Mildly Conservative Q-Learning for Offline Reinforcement Learning
by: Lyu, Jiafei, et al.
Published: (2022)
by: Lyu, Jiafei, et al.
Published: (2022)
Compositional Transduction with Latent Analogies for Offline Goal-Conditioned Reinforcement Learning
by: Kim, Junseok, et al.
Published: (2026)
by: Kim, Junseok, et al.
Published: (2026)
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
by: Giammarino, Vittorio, et al.
Published: (2025)
by: Giammarino, Vittorio, et al.
Published: (2025)
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
by: Wu, Kun, et al.
Published: (2024)
by: Wu, Kun, et al.
Published: (2024)
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
by: Park, Jongchan, et al.
Published: (2025)
by: Park, Jongchan, et al.
Published: (2025)
Adaptive Q-Chunking for Offline-to-Online Reinforcement Learning
by: Gireesh, Nandiraju, et al.
Published: (2026)
by: Gireesh, Nandiraju, et al.
Published: (2026)
Chain-of-Goals Hierarchical Policy for Long-Horizon Offline Goal-Conditioned RL
by: Choi, Jinwoo, et al.
Published: (2026)
by: Choi, Jinwoo, et al.
Published: (2026)
Occupancy Reward Shaping: Improving Credit Assignment for Offline Goal-Conditioned Reinforcement Learning
by: Venugopal, Aravind, et al.
Published: (2026)
by: Venugopal, Aravind, et al.
Published: (2026)
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
by: Kobanda, Anthony, et al.
Published: (2024)
by: Kobanda, Anthony, et al.
Published: (2024)
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
by: Sikchi, Harshit, et al.
Published: (2023)
by: Sikchi, Harshit, et al.
Published: (2023)
Adaptive Coarse-to-Fine Subgoal Refinement for Long-Horizon Offline Goal-Conditioned Reinforcement Learning
by: Ke, Kaiqiang, et al.
Published: (2026)
by: Ke, Kaiqiang, et al.
Published: (2026)
Deep Transfer $Q$-Learning for Offline Non-Stationary Reinforcement Learning
by: Chai, Jinhang, et al.
Published: (2025)
by: Chai, Jinhang, et al.
Published: (2025)
Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning
by: Kim, Jeonghye, et al.
Published: (2024)
by: Kim, Jeonghye, et al.
Published: (2024)
Similar Items
-
Value-Guidance MeanFlow for Offline Multi-Agent Reinforcement Learning
by: Pang, Teng, et al.
Published: (2026) -
Equivariant Efficient Joint Discrete and Continuous MeanFlow for Molecular Graph Generation
by: Xu, Rongjian, et al.
Published: (2026) -
Diffusion Classifier-Driven Reward for Offline Preference-based Reinforcement Learning
by: Pang, Teng, et al.
Published: (2025) -
Offline Goal-conditioned Reinforcement Learning with Quasimetric Representations
by: Myers, Vivek, et al.
Published: (2025) -
Goal-conditioned Offline Reinforcement Learning through State Space Partitioning
by: Wang, Mianchu, et al.
Published: (2023)