:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Sungyoung, Kim, Dohyeong, Balachandar, Eshan, Mustafaoglu, Zelal Su, Pingali, Keshav
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Robotics
Online Access:	https://arxiv.org/abs/2605.01663
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization
by: Su, Zelal, et al.
Published: (2026)

Evolutionary Policy Optimization
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)

Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants
by: You, Bozhi, et al.
Published: (2025)

ReFORM: Reflected Flows for On-support Offline RL via Noise Manipulation
by: Zhang, Songyuan, et al.
Published: (2026)

Causal Flow Q-Learning for Robust Offline Reinforcement Learning
by: Li, Mingxuan, et al.
Published: (2026)

FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2025)

DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
by: Kim, Changyeon, et al.
Published: (2025)

Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
by: Koirala, Prajwal, et al.
Published: (2025)

FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance
by: Kim, Sungha, et al.
Published: (2026)

Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
by: Zhang, Hongyin, et al.
Published: (2025)

Adaptive Q-Chunking for Offline-to-Online Reinforcement Learning
by: Gireesh, Nandiraju, et al.
Published: (2026)

Learning Generalizable Visuomotor Policy through Dynamics-Alignment
by: Lee, Dohyeok, et al.
Published: (2025)

Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
by: Nguyen, Thanh, et al.
Published: (2026)

Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only
by: Xiao, Wei, et al.
Published: (2025)

Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
by: Yan, Teng, et al.
Published: (2024)

Diffusion Models as Optimizers for Efficient Planning in Offline RL
by: Huang, Renming, et al.
Published: (2024)

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
by: Zhao, Kai, et al.
Published: (2023)

Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning
by: Song, Yeda, et al.
Published: (2024)

CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning
by: Rowe, Luke, et al.
Published: (2024)

Robust Policy Learning via Offline Skill Diffusion
by: Kim, Woo Kyung, et al.
Published: (2024)

Language-Conditioned Offline RL for Multi-Robot Navigation
by: Morad, Steven, et al.
Published: (2024)

ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
by: Wu, Kun, et al.
Published: (2024)

HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)

Learn Where Outcomes Diverge: Efficient VLA RL via Probabilistic Chunk Masking
by: Bagaria, Vaidehi, et al.
Published: (2026)

SutureFormer: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space
by: Liu, Huanrong, et al.
Published: (2026)

Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation
by: Li, Huanyu, et al.
Published: (2026)

Scaling Offline RL via Efficient and Expressive Shortcut Models
by: Espinosa-Dice, Nicolas, et al.
Published: (2025)

A Recipe for Stable Offline Multi-agent Reinforcement Learning
by: Lee, Dongsu, et al.
Published: (2026)

Sim-Anchored Learning for On-the-Fly Adaptation
by: Mabsout, Bassel El, et al.
Published: (2023)

Equivariant Offline Reinforcement Learning
by: Tangri, Arsh, et al.
Published: (2024)

H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps
by: Niu, Haoyi, et al.
Published: (2023)

Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach
by: Kim, Dohyeong, et al.
Published: (2024)

Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior
by: Cai, Shizhe, et al.
Published: (2025)

Boundary-to-Region Supervision for Offline Safe Reinforcement Learning
by: Su, Huikang, et al.
Published: (2025)

Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
by: Baek, Seungho, et al.
Published: (2025)

Trust Region Q Adjoint Matching
by: Dong, Yonghoon, et al.
Published: (2026)

From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning
by: Sun, Zhanyi, et al.
Published: (2026)

Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL
by: Liu, Xin, et al.
Published: (2026)

Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses
by: Nguyen, Thanh, et al.
Published: (2024)

Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
by: Lee, Dongsu, et al.
Published: (2025)