:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bai, Wensong, Zhang, Chao, Xu, Qihang, Chen, Chufan, Zhou, Chenhao, Qian, Hui
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.08584
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
by: Bai, Wensong, et al.
Published: (2023)

Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)

Off-Policy Primal-Dual Safe Reinforcement Learning
by: Wu, Zifan, et al.
Published: (2024)

OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2024)

Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
by: Feng, Meng, et al.
Published: (2025)

Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
by: Guo, Zijian, et al.
Published: (2024)

Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)

Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)

Safe In-Context Reinforcement Learning
by: Moeini, Amir, et al.
Published: (2025)

Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty
by: Wan, Xu, et al.
Published: (2026)

SafeDreamer: Safe Reinforcement Learning with World Models
by: Huang, Weidong, et al.
Published: (2023)

Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding
by: Gao, Chufan, et al.
Published: (2025)

Integrating Neural Differential Forecasting with Safe Reinforcement Learning for Blood Glucose Regulation
by: Liu, Yushen, et al.
Published: (2025)

Interpret Policies in Deep Reinforcement Learning using SILVER with RL-Guided Labeling: A Model-level Approach to High-dimensional and Multi-action Environments
by: Qian, Yiyu, et al.
Published: (2025)

Policy Bifurcation in Safe Reinforcement Learning
by: Zou, Wenjun, et al.
Published: (2024)

MoveLight: Enhancing Traffic Signal Control through Movement-Centric Deep Reinforcement Learning
by: Shao, Junqi, et al.
Published: (2024)

Feasible Policy Iteration for Safe Reinforcement Learning
by: Yang, Yujie, et al.
Published: (2023)

Counterfactually Safe Reinforcement Learning
by: Li, Jingyi, et al.
Published: (2026)

Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
by: Ji, Jiaming, et al.
Published: (2025)

Inexact Moreau Envelope Lagrangian Method for Non-Convex Constrained Optimization under Local Error Bound Conditions on Constraint Functions
by: Huang, Yankun, et al.
Published: (2025)

Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
by: Zhang, Menglong, et al.
Published: (2024)

Verified Safe Reinforcement Learning for Neural Network Dynamic Models
by: Wu, Junlin, et al.
Published: (2024)

Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
by: Dai, Juntao, et al.
Published: (2024)

Extreme Value Policy Optimization for Safe Reinforcement Learning
by: Gao, Shiqing, et al.
Published: (2026)

Integrating LTL Constraints into PPO for Safe Reinforcement Learning
by: Zhang, Maifang, et al.
Published: (2026)

Decoupled Guidance Diffusion for Adaptive Offline Safe Reinforcement Learning
by: Chen, Rufeng, et al.
Published: (2026)

Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
by: Li, Chenhao, et al.
Published: (2025)

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)

PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning
by: Guo, Weiran, et al.
Published: (2025)

Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions
by: Zhang, Harry
Published: (2024)

Safe, Efficient, and Robust Reinforcement Learning for Ranking and Diffusion Models
by: Gupta, Shashank
Published: (2025)

DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises
by: Ye, Jiasheng, et al.
Published: (2023)

PIGDreamer: Privileged Information Guided World Models for Safe Partially Observable Reinforcement Learning
by: Huang, Dongchi, et al.
Published: (2025)

Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
by: Huang, Sili, et al.
Published: (2024)

CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving
by: Booher, Jonathan, et al.
Published: (2024)

$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
by: Wang, Zepeng, et al.
Published: (2024)

Context-Former: Stitching via Latent Conditioned Sequence Modeling
by: Zhang, Ziqi, et al.
Published: (2024)

MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
by: Huang, Suning, et al.
Published: (2024)

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics
by: Iten, Klemens, et al.
Published: (2026)

Adaptive Coarse-to-Fine Subgoal Refinement for Long-Horizon Offline Goal-Conditioned Reinforcement Learning
by: Ke, Kaiqiang, et al.
Published: (2026)