:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, Haochen, Sharma, Shubham, Patra, Sunandita, Gopalakrishnan, Sriram
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2308.12367
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

QBD-RankedDataGen: Generating Custom Ranked Datasets for Improving Query-By-Document Search Using LLM-Reranking with Reduced Human Effort
by: Gopalakrishnan, Sriram, et al.
Published: (2025)

SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning
by: Anisimov, Maksim, et al.
Published: (2026)

The Importance of Time in Causal Algorithmic Recourse
by: Beretta, Isacco, et al.
Published: (2023)

Reinforcement Learning for Durable Algorithmic Recourse
by: Ceccon, Marina, et al.
Published: (2025)

Personalized Algorithmic Recourse with Preference Elicitation
by: De Toni, Giovanni, et al.
Published: (2022)

Causal Algorithmic Recourse: Foundations and Methods
by: Plecko, Drago, et al.
Published: (2026)

Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning
by: Zhao, Weiye, et al.
Published: (2024)

Safe Deep Policy Adaptation
by: Xiao, Wenli, et al.
Published: (2023)

SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2025)

Safe Flow Q-Learning: Offline Safe Reinforcement Learning with Reachability-Based Flow Policies
by: Tayal, Mumuksh, et al.
Published: (2026)

Safe Exploration via Policy Priors
by: Wendl, Manuel, et al.
Published: (2026)

Verification-Guided Falsification for Safe RL via Explainable Abstraction and Risk-Aware Exploration
by: Le, Tuan, et al.
Published: (2025)

Rating Multi-Modal Time-Series Forecasting Models (MM-TSFM) for Robustness Through a Causal Lens
by: Lakkaraju, Kausik, et al.
Published: (2024)

From Universal to Individualized Actionability: Revisiting Personalization in Algorithmic Recourse
by: Budde, Lena Marie, et al.
Published: (2026)

Skill-based Safe Reinforcement Learning with Risk Planning
by: Zhang, Hanping, et al.
Published: (2025)

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
by: Kim, Dohyeong, et al.
Published: (2024)

Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)

Deep SPI: Safe Policy Improvement via World Models
by: Delgrange, Florent, et al.
Published: (2025)

Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2024)

Creating a Causally Grounded Rating Method for Assessing the Robustness of AI Models for Time-Series Forecasting
by: Lakkaraju, Kausik, et al.
Published: (2025)

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving
by: Li, Dianzhao, et al.
Published: (2025)

Revisiting Safe Exploration in Safe Reinforcement learning
by: Eckel, David, et al.
Published: (2024)

Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
by: Chen, Keru, et al.
Published: (2024)

RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning
by: Wei, Zeming, et al.
Published: (2026)

Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)

Pareto Optimal Algorithmic Recourse in Multi-cost Function
by: Chen, Wen-Ling, et al.
Published: (2025)

From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
by: Geng, Xue, et al.
Published: (2024)

Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control
by: Chittepu, Yaswanth, et al.
Published: (2026)

Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
by: Ji, Jiaming, et al.
Published: (2025)

CAPSULE: Control-Theoretic Action Perturbations for Safe Uncertainty-Aware Reinforcement Learning
by: Narava, Rahul, et al.
Published: (2026)

Iterative Batch Reinforcement Learning via Safe Diversified Model-based Policy Search
by: Najib, Amna, et al.
Published: (2024)

Tail-Risk-Safe Monte Carlo Tree Search under PAC-Level Guarantees
by: Zhang, Zuyuan, et al.
Published: (2025)

Verified Safe Reinforcement Learning for Neural Network Dynamic Models
by: Wu, Junlin, et al.
Published: (2024)

Personalized Path Recourse for Reinforcement Learning Agents
by: Hong, Dat, et al.
Published: (2023)

Model-Based Proactive Cost Generation for Learning Safe Policies Offline with Limited Violation Data
by: Xue, Ruiqi, et al.
Published: (2026)

OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories
by: Burnwal, Returaj, et al.
Published: (2026)

LIBRA: Language Model Informed Bandit Recourse Algorithm for Personalized Treatment Planning
by: Cao, Junyu, et al.
Published: (2026)

Reinforcement Learning by Guided Safe Exploration
by: Yang, Qisong, et al.
Published: (2023)

Information-Theoretic Safe Bayesian Optimization
by: Bottero, Alessandro G., et al.
Published: (2024)

On the Mathematical Impossibility of Safe Universal Approximators
by: Yao, Jasper
Published: (2025)