Saved in:
| Main Authors: | Bai, Wensong, Zhang, Chao, Xu, Qihang, Chen, Chufan, Zhou, Chenhao, Qian, Hui |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08584 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
by: Bai, Wensong, et al.
Published: (2023)
by: Bai, Wensong, et al.
Published: (2023)
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)
by: Ying, Chengyang, et al.
Published: (2022)
Off-Policy Primal-Dual Safe Reinforcement Learning
by: Wu, Zifan, et al.
Published: (2024)
by: Wu, Zifan, et al.
Published: (2024)
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2024)
by: Yao, Yihang, et al.
Published: (2024)
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
by: Feng, Meng, et al.
Published: (2025)
by: Feng, Meng, et al.
Published: (2025)
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
by: Guo, Zijian, et al.
Published: (2024)
by: Guo, Zijian, et al.
Published: (2024)
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)
by: Yao, Yihang, et al.
Published: (2023)
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)
by: Lin, Qian, et al.
Published: (2023)
Safe In-Context Reinforcement Learning
by: Moeini, Amir, et al.
Published: (2025)
by: Moeini, Amir, et al.
Published: (2025)
Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty
by: Wan, Xu, et al.
Published: (2026)
by: Wan, Xu, et al.
Published: (2026)
SafeDreamer: Safe Reinforcement Learning with World Models
by: Huang, Weidong, et al.
Published: (2023)
by: Huang, Weidong, et al.
Published: (2023)
Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding
by: Gao, Chufan, et al.
Published: (2025)
by: Gao, Chufan, et al.
Published: (2025)
Integrating Neural Differential Forecasting with Safe Reinforcement Learning for Blood Glucose Regulation
by: Liu, Yushen, et al.
Published: (2025)
by: Liu, Yushen, et al.
Published: (2025)
Interpret Policies in Deep Reinforcement Learning using SILVER with RL-Guided Labeling: A Model-level Approach to High-dimensional and Multi-action Environments
by: Qian, Yiyu, et al.
Published: (2025)
by: Qian, Yiyu, et al.
Published: (2025)
Policy Bifurcation in Safe Reinforcement Learning
by: Zou, Wenjun, et al.
Published: (2024)
by: Zou, Wenjun, et al.
Published: (2024)
MoveLight: Enhancing Traffic Signal Control through Movement-Centric Deep Reinforcement Learning
by: Shao, Junqi, et al.
Published: (2024)
by: Shao, Junqi, et al.
Published: (2024)
Feasible Policy Iteration for Safe Reinforcement Learning
by: Yang, Yujie, et al.
Published: (2023)
by: Yang, Yujie, et al.
Published: (2023)
Counterfactually Safe Reinforcement Learning
by: Li, Jingyi, et al.
Published: (2026)
by: Li, Jingyi, et al.
Published: (2026)
Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
by: Ji, Jiaming, et al.
Published: (2025)
by: Ji, Jiaming, et al.
Published: (2025)
Inexact Moreau Envelope Lagrangian Method for Non-Convex Constrained Optimization under Local Error Bound Conditions on Constraint Functions
by: Huang, Yankun, et al.
Published: (2025)
by: Huang, Yankun, et al.
Published: (2025)
Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents
by: Zhang, Menglong, et al.
Published: (2024)
by: Zhang, Menglong, et al.
Published: (2024)
Verified Safe Reinforcement Learning for Neural Network Dynamic Models
by: Wu, Junlin, et al.
Published: (2024)
by: Wu, Junlin, et al.
Published: (2024)
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
by: Dai, Juntao, et al.
Published: (2024)
by: Dai, Juntao, et al.
Published: (2024)
Extreme Value Policy Optimization for Safe Reinforcement Learning
by: Gao, Shiqing, et al.
Published: (2026)
by: Gao, Shiqing, et al.
Published: (2026)
Integrating LTL Constraints into PPO for Safe Reinforcement Learning
by: Zhang, Maifang, et al.
Published: (2026)
by: Zhang, Maifang, et al.
Published: (2026)
Decoupled Guidance Diffusion for Adaptive Offline Safe Reinforcement Learning
by: Chen, Rufeng, et al.
Published: (2026)
by: Chen, Rufeng, et al.
Published: (2026)
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
by: Li, Chenhao, et al.
Published: (2025)
by: Li, Chenhao, et al.
Published: (2025)
Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning
by: Guo, Weiran, et al.
Published: (2025)
by: Guo, Weiran, et al.
Published: (2025)
Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions
by: Zhang, Harry
Published: (2024)
by: Zhang, Harry
Published: (2024)
Safe, Efficient, and Robust Reinforcement Learning for Ranking and Diffusion Models
by: Gupta, Shashank
Published: (2025)
by: Gupta, Shashank
Published: (2025)
DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises
by: Ye, Jiasheng, et al.
Published: (2023)
by: Ye, Jiasheng, et al.
Published: (2023)
PIGDreamer: Privileged Information Guided World Models for Safe Partially Observable Reinforcement Learning
by: Huang, Dongchi, et al.
Published: (2025)
by: Huang, Dongchi, et al.
Published: (2025)
Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling
by: Huang, Sili, et al.
Published: (2024)
by: Huang, Sili, et al.
Published: (2024)
CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving
by: Booher, Jonathan, et al.
Published: (2024)
by: Booher, Jonathan, et al.
Published: (2024)
$\mathrm{E^{2}CFD}$: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
by: Wang, Zepeng, et al.
Published: (2024)
by: Wang, Zepeng, et al.
Published: (2024)
Context-Former: Stitching via Latent Conditioned Sequence Modeling
by: Zhang, Ziqi, et al.
Published: (2024)
by: Zhang, Ziqi, et al.
Published: (2024)
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
by: Huang, Suning, et al.
Published: (2024)
by: Huang, Suning, et al.
Published: (2024)
Model-Based Reinforcement Learning for Control under Time-Varying Dynamics
by: Iten, Klemens, et al.
Published: (2026)
by: Iten, Klemens, et al.
Published: (2026)
Adaptive Coarse-to-Fine Subgoal Refinement for Long-Horizon Offline Goal-Conditioned Reinforcement Learning
by: Ke, Kaiqiang, et al.
Published: (2026)
by: Ke, Kaiqiang, et al.
Published: (2026)
Similar Items
-
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
by: Bai, Wensong, et al.
Published: (2023) -
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022) -
Off-Policy Primal-Dual Safe Reinforcement Learning
by: Wu, Zifan, et al.
Published: (2024) -
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2024) -
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning
by: Feng, Meng, et al.
Published: (2025)