Saved in:
| Main Authors: | Lee, Sungyoung, Kim, Dohyeong, Balachandar, Eshan, Mustafaoglu, Zelal Su, Pingali, Keshav |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.01663 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization
by: Su, Zelal, et al.
Published: (2026)
by: Su, Zelal, et al.
Published: (2026)
Evolutionary Policy Optimization
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants
by: You, Bozhi, et al.
Published: (2025)
by: You, Bozhi, et al.
Published: (2025)
ReFORM: Reflected Flows for On-support Offline RL via Noise Manipulation
by: Zhang, Songyuan, et al.
Published: (2026)
by: Zhang, Songyuan, et al.
Published: (2026)
Causal Flow Q-Learning for Robust Offline Reinforcement Learning
by: Li, Mingxuan, et al.
Published: (2026)
by: Li, Mingxuan, et al.
Published: (2026)
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2025)
by: Alles, Marvin, et al.
Published: (2025)
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
by: Kim, Changyeon, et al.
Published: (2025)
by: Kim, Changyeon, et al.
Published: (2025)
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
by: Koirala, Prajwal, et al.
Published: (2025)
by: Koirala, Prajwal, et al.
Published: (2025)
FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance
by: Kim, Sungha, et al.
Published: (2026)
by: Kim, Sungha, et al.
Published: (2026)
Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
by: Zhang, Hongyin, et al.
Published: (2025)
by: Zhang, Hongyin, et al.
Published: (2025)
Adaptive Q-Chunking for Offline-to-Online Reinforcement Learning
by: Gireesh, Nandiraju, et al.
Published: (2026)
by: Gireesh, Nandiraju, et al.
Published: (2026)
Learning Generalizable Visuomotor Policy through Dynamics-Alignment
by: Lee, Dohyeok, et al.
Published: (2025)
by: Lee, Dohyeok, et al.
Published: (2025)
Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
by: Nguyen, Thanh, et al.
Published: (2026)
by: Nguyen, Thanh, et al.
Published: (2026)
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only
by: Xiao, Wei, et al.
Published: (2025)
by: Xiao, Wei, et al.
Published: (2025)
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
by: Yan, Teng, et al.
Published: (2024)
by: Yan, Teng, et al.
Published: (2024)
Diffusion Models as Optimizers for Efficient Planning in Offline RL
by: Huang, Renming, et al.
Published: (2024)
by: Huang, Renming, et al.
Published: (2024)
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
by: Zhao, Kai, et al.
Published: (2023)
by: Zhao, Kai, et al.
Published: (2023)
Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning
by: Song, Yeda, et al.
Published: (2024)
by: Song, Yeda, et al.
Published: (2024)
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning
by: Rowe, Luke, et al.
Published: (2024)
by: Rowe, Luke, et al.
Published: (2024)
Robust Policy Learning via Offline Skill Diffusion
by: Kim, Woo Kyung, et al.
Published: (2024)
by: Kim, Woo Kyung, et al.
Published: (2024)
Language-Conditioned Offline RL for Multi-Robot Navigation
by: Morad, Steven, et al.
Published: (2024)
by: Morad, Steven, et al.
Published: (2024)
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
by: Wu, Kun, et al.
Published: (2024)
by: Wu, Kun, et al.
Published: (2024)
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)
by: Park, Seohong, et al.
Published: (2023)
Learn Where Outcomes Diverge: Efficient VLA RL via Probabilistic Chunk Masking
by: Bagaria, Vaidehi, et al.
Published: (2026)
by: Bagaria, Vaidehi, et al.
Published: (2026)
SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space
by: Liu, Huanrong, et al.
Published: (2026)
by: Liu, Huanrong, et al.
Published: (2026)
Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation
by: Li, Huanyu, et al.
Published: (2026)
by: Li, Huanyu, et al.
Published: (2026)
Scaling Offline RL via Efficient and Expressive Shortcut Models
by: Espinosa-Dice, Nicolas, et al.
Published: (2025)
by: Espinosa-Dice, Nicolas, et al.
Published: (2025)
A Recipe for Stable Offline Multi-agent Reinforcement Learning
by: Lee, Dongsu, et al.
Published: (2026)
by: Lee, Dongsu, et al.
Published: (2026)
Sim-Anchored Learning for On-the-Fly Adaptation
by: Mabsout, Bassel El, et al.
Published: (2023)
by: Mabsout, Bassel El, et al.
Published: (2023)
Equivariant Offline Reinforcement Learning
by: Tangri, Arsh, et al.
Published: (2024)
by: Tangri, Arsh, et al.
Published: (2024)
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
by: Niu, Haoyi, et al.
Published: (2023)
by: Niu, Haoyi, et al.
Published: (2023)
Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach
by: Kim, Dohyeong, et al.
Published: (2024)
by: Kim, Dohyeong, et al.
Published: (2024)
Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior
by: Cai, Shizhe, et al.
Published: (2025)
by: Cai, Shizhe, et al.
Published: (2025)
Boundary-to-Region Supervision for Offline Safe Reinforcement Learning
by: Su, Huikang, et al.
Published: (2025)
by: Su, Huikang, et al.
Published: (2025)
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
by: Baek, Seungho, et al.
Published: (2025)
by: Baek, Seungho, et al.
Published: (2025)
Trust Region Q Adjoint Matching
by: Dong, Yonghoon, et al.
Published: (2026)
by: Dong, Yonghoon, et al.
Published: (2026)
From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning
by: Sun, Zhanyi, et al.
Published: (2026)
by: Sun, Zhanyi, et al.
Published: (2026)
Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL
by: Liu, Xin, et al.
Published: (2026)
by: Liu, Xin, et al.
Published: (2026)
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses
by: Nguyen, Thanh, et al.
Published: (2024)
by: Nguyen, Thanh, et al.
Published: (2024)
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
by: Lee, Dongsu, et al.
Published: (2025)
by: Lee, Dongsu, et al.
Published: (2025)
Similar Items
-
Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization
by: Su, Zelal, et al.
Published: (2026) -
Evolutionary Policy Optimization
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025) -
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants
by: You, Bozhi, et al.
Published: (2025) -
ReFORM: Reflected Flows for On-support Offline RL via Noise Manipulation
by: Zhang, Songyuan, et al.
Published: (2026) -
Causal Flow Q-Learning for Robust Offline Reinforcement Learning
by: Li, Mingxuan, et al.
Published: (2026)