Saved in:
| Main Authors: | Wachi, Akifumi, Shen, Xun, Sui, Yanan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.02025 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Long-term Safe Reinforcement Learning with Binary Feedback
by: Wachi, Akifumi, et al.
Published: (2024)
by: Wachi, Akifumi, et al.
Published: (2024)
A Provable Approach for End-to-End Safe Reinforcement Learning
by: Wachi, Akifumi, et al.
Published: (2025)
by: Wachi, Akifumi, et al.
Published: (2025)
Target Return Optimizer for Multi-Game Decision Transformer
by: Tatematsu, Kensuke, et al.
Published: (2025)
by: Tatematsu, Kensuke, et al.
Published: (2025)
A Relative-Budget Theory for Reinforcement Learning with Verifiable Rewards in Large Language Model Reasoning
by: Wachi, Akifumi, et al.
Published: (2026)
by: Wachi, Akifumi, et al.
Published: (2026)
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning
by: Kudo, Mikoto, et al.
Published: (2026)
by: Kudo, Mikoto, et al.
Published: (2026)
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies
by: Yan, Runze, et al.
Published: (2025)
by: Yan, Runze, et al.
Published: (2025)
Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment
by: Kusaka, Shigeki, et al.
Published: (2025)
by: Kusaka, Shigeki, et al.
Published: (2025)
Stepwise Alignment for Constrained Language Model Policy Optimization
by: Wachi, Akifumi, et al.
Published: (2024)
by: Wachi, Akifumi, et al.
Published: (2024)
Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing
by: Tran, Thien Q., et al.
Published: (2025)
by: Tran, Thien Q., et al.
Published: (2025)
Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints
by: Low, Siow Meng, et al.
Published: (2024)
by: Low, Siow Meng, et al.
Published: (2024)
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2024)
by: Chemingui, Yassine, et al.
Published: (2024)
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)
by: Yao, Yihang, et al.
Published: (2023)
SB-TRPO: Towards Safe Reinforcement Learning with Hard Constraints
by: Wagner, Dominik, et al.
Published: (2025)
by: Wagner, Dominik, et al.
Published: (2025)
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)
by: Lin, Qian, et al.
Published: (2023)
Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving
by: Hu, Xuemin, et al.
Published: (2024)
by: Hu, Xuemin, et al.
Published: (2024)
Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
Flipping-based Policy for Chance-Constrained Markov Decision Processes
by: Shen, Xun, et al.
Published: (2024)
by: Shen, Xun, et al.
Published: (2024)
Integrating LTL Constraints into PPO for Safe Reinforcement Learning
by: Zhang, Maifang, et al.
Published: (2026)
by: Zhang, Maifang, et al.
Published: (2026)
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
by: Shen, Yi, et al.
Published: (2024)
by: Shen, Yi, et al.
Published: (2024)
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
by: Zhang, Xinglong, et al.
Published: (2021)
by: Zhang, Xinglong, et al.
Published: (2021)
Reinforcement Learning by Guided Safe Exploration
by: Yang, Qisong, et al.
Published: (2023)
by: Yang, Qisong, et al.
Published: (2023)
Probabilistic Shielding for Safe Reinforcement Learning
by: Court, Edwin Hamel-De le, et al.
Published: (2025)
by: Court, Edwin Hamel-De le, et al.
Published: (2025)
SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning
by: Anisimov, Maksim, et al.
Published: (2026)
by: Anisimov, Maksim, et al.
Published: (2026)
Safe Reinforcement Learning with Preference-based Constraint Inference
by: Li, Chenglin, et al.
Published: (2026)
by: Li, Chenglin, et al.
Published: (2026)
Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
by: Zhao, Weiye, et al.
Published: (2024)
by: Zhao, Weiye, et al.
Published: (2024)
A Harmonic Mean Formulation of Average Reward Reinforcement Learning in SMDPs
by: Shtossel, Erel, et al.
Published: (2026)
by: Shtossel, Erel, et al.
Published: (2026)
GUARD: A Safe Reinforcement Learning Benchmark
by: Zhao, Weiye, et al.
Published: (2023)
by: Zhao, Weiye, et al.
Published: (2023)
Online Optimization for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2025)
by: Chemingui, Yassine, et al.
Published: (2025)
Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies
by: Tayal, Mumuksh, et al.
Published: (2026)
by: Tayal, Mumuksh, et al.
Published: (2026)
Sampling-Based Safe Reinforcement Learning
by: Vignola, Luca, et al.
Published: (2026)
by: Vignola, Luca, et al.
Published: (2026)
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning
by: Vaskov, Sean, et al.
Published: (2024)
by: Vaskov, Sean, et al.
Published: (2024)
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
by: Ji, Jiaming, et al.
Published: (2023)
by: Ji, Jiaming, et al.
Published: (2023)
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
by: Gu, Shangding, et al.
Published: (2022)
by: Gu, Shangding, et al.
Published: (2022)
Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
by: Ji, Jiaming, et al.
Published: (2025)
by: Ji, Jiaming, et al.
Published: (2025)
Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning
by: Doan, Duc Kien, et al.
Published: (2025)
by: Doan, Duc Kien, et al.
Published: (2025)
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
by: Gao, Yunkai, et al.
Published: (2025)
by: Gao, Yunkai, et al.
Published: (2025)
Offline Safe Reinforcement Learning Using Trajectory Classification
by: Gong, Ze, et al.
Published: (2024)
by: Gong, Ze, et al.
Published: (2024)
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
by: Kim, Dohyeong, et al.
Published: (2024)
by: Kim, Dohyeong, et al.
Published: (2024)
PNAct: Crafting Backdoor Attacks in Safe Reinforcement Learning
by: Guo, Weiran, et al.
Published: (2025)
by: Guo, Weiran, et al.
Published: (2025)
Safe Reinforcement Learning for Real-World Engine Control
by: Bedei, Julian, et al.
Published: (2025)
by: Bedei, Julian, et al.
Published: (2025)
Similar Items
-
Long-term Safe Reinforcement Learning with Binary Feedback
by: Wachi, Akifumi, et al.
Published: (2024) -
A Provable Approach for End-to-End Safe Reinforcement Learning
by: Wachi, Akifumi, et al.
Published: (2025) -
Target Return Optimizer for Multi-Game Decision Transformer
by: Tatematsu, Kensuke, et al.
Published: (2025) -
A Relative-Budget Theory for Reinforcement Learning with Verifiable Rewards in Large Language Model Reasoning
by: Wachi, Akifumi, et al.
Published: (2026) -
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning
by: Kudo, Mikoto, et al.
Published: (2026)