:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Khan, Fairoz Nower, Nahim, Nabuat Zaman, Ju, Peizhong
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.12379
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Flow Matching for Offline Reinforcement Learning with Discrete Actions
by: Khan, Fairoz Nower, et al.
Published: (2026)

Discrete MeanFlow: One-Step Generation via Conditional Transition Kernels
by: Khan, Fairoz Nower, et al.
Published: (2026)

Flow Matching with Injected Noise for Offline-to-Online Reinforcement Learning
by: Shin, Yongjae, et al.
Published: (2026)

Epigraph-Guided Flow Matching for Safe and Performant Offline Reinforcement Learning
by: Tayal, Manan, et al.
Published: (2026)

Controllable Flow Matching for Online Reinforcement Learning
by: Wang, Bin, et al.
Published: (2025)

Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual
by: Li, Yining, et al.
Published: (2026)

Offline Reinforcement Learning with Discrete Diffusion Skills
by: Qiao, RuiXi, et al.
Published: (2025)

Online Optimization for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2025)

The Three Regimes of Offline-to-Online Reinforcement Learning
by: Li, Lu, et al.
Published: (2025)

Entropy-Regularized Adjoint Matching for Offline Reinforcement Learning
by: Ghanem, Abdelghani, et al.
Published: (2026)

Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution
by: Haider, Muhammad Umair, et al.
Published: (2025)

Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation
by: Bhuiyan, Hasnat Jamil, et al.
Published: (2024)

Flow Actor-Critic for Offline Reinforcement Learning
by: Chae, Jongseong, et al.
Published: (2026)

Discrete Flow Matching
by: Gat, Itai, et al.
Published: (2024)

Adaptive Replay Buffer for Offline-to-Online Reinforcement Learning
by: Song, Chihyeon, et al.
Published: (2025)

PSMGD: Periodic Stochastic Multi-Gradient Descent for Fast Multi-Objective Optimization
by: Xu, Mingjing, et al.
Published: (2024)

FOVA: Offline Federated Reinforcement Learning with Mixed-Quality Data
by: Qiao, Nan, et al.
Published: (2025)

OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
by: Yue, Sheng, et al.
Published: (2024)

Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)

RLSynC: Offline-Online Reinforcement Learning for Synthon Completion
by: Baker, Frazier N., et al.
Published: (2023)

SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
by: Zhang, Liyu, et al.
Published: (2024)

Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
by: Wang, Changhong, et al.
Published: (2024)

Causal Flow Q-Learning for Robust Offline Reinforcement Learning
by: Li, Mingxuan, et al.
Published: (2026)

FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
by: Alles, Marvin, et al.
Published: (2025)

ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
by: Zhao, Kai, et al.
Published: (2023)

Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies
by: Tayal, Mumuksh, et al.
Published: (2026)

Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)

SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer
by: de Lara, Nathan Samuel, et al.
Published: (2026)

A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
by: Guo, Siyuan, et al.
Published: (2023)

Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
by: Zhang, Ziqi, et al.
Published: (2023)

Selective Reincarnation: Offline-to-Online Multi-Agent Reinforcement Learning
by: Formanek, Claude, et al.
Published: (2023)

Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
by: Tiofack, Franki Nguimatsia, et al.
Published: (2025)

FlowCritic: Bridging Value Estimation with Flow Matching in Reinforcement Learning
by: Zhong, Shan, et al.
Published: (2025)

When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
by: Niu, Haoyi, et al.
Published: (2022)

Flow-Based Policy for Online Reinforcement Learning
by: Lv, Lei, et al.
Published: (2025)

Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2026)

ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization
by: Yang, Letian, et al.
Published: (2026)

Preference Elicitation for Offline Reinforcement Learning
by: Pace, Alizée, et al.
Published: (2024)

Offline Reinforcement Learning with Imbalanced Datasets
by: Jiang, Li, et al.
Published: (2023)

Simple Ingredients for Offline Reinforcement Learning
by: Cetin, Edoardo, et al.
Published: (2024)