Saved in:
| Main Authors: | Wu, Haochen, Sharma, Shubham, Patra, Sunandita, Gopalakrishnan, Sriram |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2308.12367 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
QBD-RankedDataGen: Generating Custom Ranked Datasets for Improving Query-By-Document Search Using LLM-Reranking with Reduced Human Effort
by: Gopalakrishnan, Sriram, et al.
Published: (2025)
by: Gopalakrishnan, Sriram, et al.
Published: (2025)
SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning
by: Anisimov, Maksim, et al.
Published: (2026)
by: Anisimov, Maksim, et al.
Published: (2026)
The Importance of Time in Causal Algorithmic Recourse
by: Beretta, Isacco, et al.
Published: (2023)
by: Beretta, Isacco, et al.
Published: (2023)
Reinforcement Learning for Durable Algorithmic Recourse
by: Ceccon, Marina, et al.
Published: (2025)
by: Ceccon, Marina, et al.
Published: (2025)
Personalized Algorithmic Recourse with Preference Elicitation
by: De Toni, Giovanni, et al.
Published: (2022)
by: De Toni, Giovanni, et al.
Published: (2022)
Causal Algorithmic Recourse: Foundations and Methods
by: Plecko, Drago, et al.
Published: (2026)
by: Plecko, Drago, et al.
Published: (2026)
Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
by: Zhao, Weiye, et al.
Published: (2024)
by: Zhao, Weiye, et al.
Published: (2024)
Safe Deep Policy Adaptation
by: Xiao, Wenli, et al.
Published: (2023)
by: Xiao, Wenli, et al.
Published: (2023)
SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2025)
by: Burnwal, Returaj, et al.
Published: (2025)
Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies
by: Tayal, Mumuksh, et al.
Published: (2026)
by: Tayal, Mumuksh, et al.
Published: (2026)
Safe Exploration via Policy Priors
by: Wendl, Manuel, et al.
Published: (2026)
by: Wendl, Manuel, et al.
Published: (2026)
Verification-Guided Falsification for Safe RL via Explainable Abstraction and Risk-Aware Exploration
by: Le, Tuan, et al.
Published: (2025)
by: Le, Tuan, et al.
Published: (2025)
Rating Multi-Modal Time-Series Forecasting Models (MM-TSFM) for Robustness Through a Causal Lens
by: Lakkaraju, Kausik, et al.
Published: (2024)
by: Lakkaraju, Kausik, et al.
Published: (2024)
From Universal to Individualized Actionability: Revisiting Personalization in Algorithmic Recourse
by: Budde, Lena Marie, et al.
Published: (2026)
by: Budde, Lena Marie, et al.
Published: (2026)
Skill-based Safe Reinforcement Learning with Risk Planning
by: Zhang, Hanping, et al.
Published: (2025)
by: Zhang, Hanping, et al.
Published: (2025)
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
by: Kim, Dohyeong, et al.
Published: (2024)
by: Kim, Dohyeong, et al.
Published: (2024)
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)
by: Yao, Yihang, et al.
Published: (2023)
Deep SPI: Safe Policy Improvement via World Models
by: Delgrange, Florent, et al.
Published: (2025)
by: Delgrange, Florent, et al.
Published: (2025)
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2024)
by: Chemingui, Yassine, et al.
Published: (2024)
Creating a Causally Grounded Rating Method for Assessing the Robustness of AI Models for Time-Series Forecasting
by: Lakkaraju, Kausik, et al.
Published: (2025)
by: Lakkaraju, Kausik, et al.
Published: (2025)
Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving
by: Li, Dianzhao, et al.
Published: (2025)
by: Li, Dianzhao, et al.
Published: (2025)
Revisiting Safe Exploration in Safe Reinforcement learning
by: Eckel, David, et al.
Published: (2024)
by: Eckel, David, et al.
Published: (2024)
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
by: Chen, Keru, et al.
Published: (2024)
by: Chen, Keru, et al.
Published: (2024)
RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning
by: Wei, Zeming, et al.
Published: (2026)
by: Wei, Zeming, et al.
Published: (2026)
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)
by: Ying, Chengyang, et al.
Published: (2022)
Pareto Optimal Algorithmic Recourse in Multi-cost Function
by: Chen, Wen-Ling, et al.
Published: (2025)
by: Chen, Wen-Ling, et al.
Published: (2025)
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
by: Geng, Xue, et al.
Published: (2024)
by: Geng, Xue, et al.
Published: (2024)
Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control
by: Chittepu, Yaswanth, et al.
Published: (2026)
by: Chittepu, Yaswanth, et al.
Published: (2026)
Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
by: Ji, Jiaming, et al.
Published: (2025)
by: Ji, Jiaming, et al.
Published: (2025)
CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning
by: Narava, Rahul, et al.
Published: (2026)
by: Narava, Rahul, et al.
Published: (2026)
Iterative Batch Reinforcement Learning via Safe Diversified Model-based Policy Search
by: Najib, Amna, et al.
Published: (2024)
by: Najib, Amna, et al.
Published: (2024)
Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees
by: Zhang, Zuyuan, et al.
Published: (2025)
by: Zhang, Zuyuan, et al.
Published: (2025)
Verified Safe Reinforcement Learning for Neural Network Dynamic Models
by: Wu, Junlin, et al.
Published: (2024)
by: Wu, Junlin, et al.
Published: (2024)
Personalized Path Recourse for Reinforcement Learning Agents
by: Hong, Dat, et al.
Published: (2023)
by: Hong, Dat, et al.
Published: (2023)
Model-Based Proactive Cost Generation for Learning Safe Policies Offline with Limited Violation Data
by: Xue, Ruiqi, et al.
Published: (2026)
by: Xue, Ruiqi, et al.
Published: (2026)
OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2026)
by: Burnwal, Returaj, et al.
Published: (2026)
LIBRA: Language Model Informed Bandit Recourse Algorithm for Personalized Treatment Planning
by: Cao, Junyu, et al.
Published: (2026)
by: Cao, Junyu, et al.
Published: (2026)
Reinforcement Learning by Guided Safe Exploration
by: Yang, Qisong, et al.
Published: (2023)
by: Yang, Qisong, et al.
Published: (2023)
Information-Theoretic Safe Bayesian Optimization
by: Bottero, Alessandro G., et al.
Published: (2024)
by: Bottero, Alessandro G., et al.
Published: (2024)
On the Mathematical Impossibility of Safe Universal Approximators
by: Yao, Jasper
Published: (2025)
by: Yao, Jasper
Published: (2025)
Similar Items
-
QBD-RankedDataGen: Generating Custom Ranked Datasets for Improving Query-By-Document Search Using LLM-Reranking with Reduced Human Effort
by: Gopalakrishnan, Sriram, et al.
Published: (2025) -
SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning
by: Anisimov, Maksim, et al.
Published: (2026) -
The Importance of Time in Causal Algorithmic Recourse
by: Beretta, Isacco, et al.
Published: (2023) -
Reinforcement Learning for Durable Algorithmic Recourse
by: Ceccon, Marina, et al.
Published: (2025) -
Personalized Algorithmic Recourse with Preference Elicitation
by: De Toni, Giovanni, et al.
Published: (2022)