Guardado en:
| Autores principales: | Skalse, Joar, Abate, Alessandro |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2411.15951 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting
por: Skalse, Joar, et al.
Publicado: (2024)
por: Skalse, Joar, et al.
Publicado: (2024)
Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification
por: Skalse, Joar, et al.
Publicado: (2024)
por: Skalse, Joar, et al.
Publicado: (2024)
On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks
por: Skalse, Joar, et al.
Publicado: (2024)
por: Skalse, Joar, et al.
Publicado: (2024)
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret
por: Fluri, Lukas, et al.
Publicado: (2024)
por: Fluri, Lukas, et al.
Publicado: (2024)
STARC: A General Framework For Quantifying Differences Between Reward Functions
por: Skalse, Joar, et al.
Publicado: (2023)
por: Skalse, Joar, et al.
Publicado: (2023)
Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning
por: Melo, Luckeciano C., et al.
Publicado: (2025)
por: Melo, Luckeciano C., et al.
Publicado: (2025)
Probabilistic Performance Guarantees for Multi-Task Reinforcement Learning
por: Schnitzer, Yannik, et al.
Publicado: (2026)
por: Schnitzer, Yannik, et al.
Publicado: (2026)
DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL
por: Jackermeier, Mathias, et al.
Publicado: (2024)
por: Jackermeier, Mathias, et al.
Publicado: (2024)
Efficient Solution and Learning of Robust Factored MDPs
por: Schnitzer, Yannik, et al.
Publicado: (2025)
por: Schnitzer, Yannik, et al.
Publicado: (2025)
On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning
por: Subramani, Rohan, et al.
Publicado: (2023)
por: Subramani, Rohan, et al.
Publicado: (2023)
Semantically Labelled Automata for Multi-Task Reinforcement Learning with LTL Instructions
por: Abate, Alessandro, et al.
Publicado: (2026)
por: Abate, Alessandro, et al.
Publicado: (2026)
Temporal-Difference Variational Continual Learning
por: Melo, Luckeciano C., et al.
Publicado: (2024)
por: Melo, Luckeciano C., et al.
Publicado: (2024)
TROFI: Trajectory-Ranked Offline Inverse Reinforcement Learning
por: Sestini, Alessandro, et al.
Publicado: (2025)
por: Sestini, Alessandro, et al.
Publicado: (2025)
Efficient Imitation under Misspecification
por: Espinosa-Dice, Nicolas, et al.
Publicado: (2025)
por: Espinosa-Dice, Nicolas, et al.
Publicado: (2025)
Neural Proofs for Sound Verification and Control of Complex Systems
por: Abate, Alessandro
Publicado: (2025)
por: Abate, Alessandro
Publicado: (2025)
Robust Shielding for Safe Reinforcement Learning
por: Court, Edwin Hamel-De le, et al.
Publicado: (2026)
por: Court, Edwin Hamel-De le, et al.
Publicado: (2026)
Priors Matter: Addressing Misspecification in Bayesian Deep Q-Learning
por: van der Vaart, Pascal R., et al.
Publicado: (2025)
por: van der Vaart, Pascal R., et al.
Publicado: (2025)
Zero-Shot Instruction Following in RL via Structured LTL Representations
por: Giuri, Mattia, et al.
Publicado: (2025)
por: Giuri, Mattia, et al.
Publicado: (2025)
Hybrid Inverse Reinforcement Learning
por: Ren, Juntao, et al.
Publicado: (2024)
por: Ren, Juntao, et al.
Publicado: (2024)
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
por: Luis, Carlos E., et al.
Publicado: (2024)
por: Luis, Carlos E., et al.
Publicado: (2024)
Fast Rates for Inverse Reinforcement Learning
por: Schlaginhaufen, Andreas, et al.
Publicado: (2026)
por: Schlaginhaufen, Andreas, et al.
Publicado: (2026)
On the Effective Horizon of Inverse Reinforcement Learning
por: Xu, Yiqing, et al.
Publicado: (2023)
por: Xu, Yiqing, et al.
Publicado: (2023)
Environment Design for Inverse Reinforcement Learning
por: Buening, Thomas Kleine, et al.
Publicado: (2022)
por: Buening, Thomas Kleine, et al.
Publicado: (2022)
Recursive Deep Inverse Reinforcement Learning
por: Ghanem, Paul, et al.
Publicado: (2025)
por: Ghanem, Paul, et al.
Publicado: (2025)
Inverse Reinforcement Learning With Constraint Recovery
por: Das, Nirjhar, et al.
Publicado: (2023)
por: Das, Nirjhar, et al.
Publicado: (2023)
Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols
por: Griffin, Charlie, et al.
Publicado: (2024)
por: Griffin, Charlie, et al.
Publicado: (2024)
Zero-Shot Instruction Following in RL via Structured LTL Representations
por: Jackermeier, Mathias, et al.
Publicado: (2026)
por: Jackermeier, Mathias, et al.
Publicado: (2026)
Certifiably Robust Policies for Uncertain Parametric Environments
por: Schnitzer, Yannik, et al.
Publicado: (2024)
por: Schnitzer, Yannik, et al.
Publicado: (2024)
Inverse Reinforcement Learning with Sub-optimal Experts
por: Poiani, Riccardo, et al.
Publicado: (2024)
por: Poiani, Riccardo, et al.
Publicado: (2024)
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
por: Beliaev, Mark, et al.
Publicado: (2024)
por: Beliaev, Mark, et al.
Publicado: (2024)
Is Optimal Transport Necessary for Inverse Reinforcement Learning?
por: Dong, Zixuan, et al.
Publicado: (2025)
por: Dong, Zixuan, et al.
Publicado: (2025)
Kernel Density Bayesian Inverse Reinforcement Learning
por: Mandyam, Aishwarya, et al.
Publicado: (2023)
por: Mandyam, Aishwarya, et al.
Publicado: (2023)
Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch
por: Mechergui, Malek, et al.
Publicado: (2024)
por: Mechergui, Malek, et al.
Publicado: (2024)
Towards Interpretable Deep Reinforcement Learning Models via Inverse Reinforcement Learning
por: Xie, Sean, et al.
Publicado: (2022)
por: Xie, Sean, et al.
Publicado: (2022)
Near-Optimal Partially Observable Reinforcement Learning with Partial Online State Information
por: Shi, Ming, et al.
Publicado: (2023)
por: Shi, Ming, et al.
Publicado: (2023)
Inverse Delayed Reinforcement Learning
por: Zhan, Simon Sinong, et al.
Publicado: (2024)
por: Zhan, Simon Sinong, et al.
Publicado: (2024)
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective
por: Zhao, Lei, et al.
Publicado: (2023)
por: Zhao, Lei, et al.
Publicado: (2023)
Inverse Reinforcement Learning from Non-Stationary Learning Agents
por: Sivakumar, Kavinayan P., et al.
Publicado: (2024)
por: Sivakumar, Kavinayan P., et al.
Publicado: (2024)
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
por: Yue, Bo, et al.
Publicado: (2024)
por: Yue, Bo, et al.
Publicado: (2024)
Defining and Characterizing Reward Hacking
por: Skalse, Joar, et al.
Publicado: (2022)
por: Skalse, Joar, et al.
Publicado: (2022)
Ejemplares similares
-
Partial Identifiability in Inverse Reinforcement Learning For Agents With Non-Exponential Discounting
por: Skalse, Joar, et al.
Publicado: (2024) -
Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification
por: Skalse, Joar, et al.
Publicado: (2024) -
On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks
por: Skalse, Joar, et al.
Publicado: (2024) -
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret
por: Fluri, Lukas, et al.
Publicado: (2024) -
STARC: A General Framework For Quantifying Differences Between Reward Functions
por: Skalse, Joar, et al.
Publicado: (2023)