Saved in:
| Main Authors: | Bhamidipaty, Logan Mondal, Whitammer, Esmeralda S., Abel, David, Kochenderfer, Mykel J., Ramamoorthy, Subramanian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.15960 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Repairing Reward Functions with Feedback to Mitigate Reward Hacking
by: Hatgis-Kessell, Stephane, et al.
Published: (2025)
by: Hatgis-Kessell, Stephane, et al.
Published: (2025)
Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization
by: Chaubard, Francois, et al.
Published: (2025)
by: Chaubard, Francois, et al.
Published: (2025)
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering
by: Chaubard, Francois, et al.
Published: (2024)
by: Chaubard, Francois, et al.
Published: (2024)
Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals
by: Delecki, Harrison, et al.
Published: (2024)
by: Delecki, Harrison, et al.
Published: (2024)
Graph Q-Learning for Combinatorial Optimization
by: Dax, Victoria M., et al.
Published: (2024)
by: Dax, Victoria M., et al.
Published: (2024)
Adaptive Splitting of Reusable Temporal Monitors for Rare Traffic Violations
by: Innes, Craig, et al.
Published: (2024)
by: Innes, Craig, et al.
Published: (2024)
Learning from Demonstration with Implicit Nonlinear Dynamics Models
by: Fagan, Peter David, et al.
Published: (2024)
by: Fagan, Peter David, et al.
Published: (2024)
Zono-Conformal Prediction: Zonotope-Based Uncertainty Quantification for Regression and Classification Tasks
by: Lützow, Laura, et al.
Published: (2025)
by: Lützow, Laura, et al.
Published: (2025)
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
by: Reuel, Anka, et al.
Published: (2024)
by: Reuel, Anka, et al.
Published: (2024)
Enhanced Importance Sampling through Latent Space Exploration in Normalizing Flows
by: Kruse, Liam A., et al.
Published: (2025)
by: Kruse, Liam A., et al.
Published: (2025)
Beyond Discriminant Patterns: On the Robustness of Decision Rule Ensembles
by: Du, Xin, et al.
Published: (2021)
by: Du, Xin, et al.
Published: (2021)
SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving
by: Yildiz, Anil, et al.
Published: (2025)
by: Yildiz, Anil, et al.
Published: (2025)
Conditional Deep Generative Models for Belief State Planning
by: Bigeard, Antoine, et al.
Published: (2025)
by: Bigeard, Antoine, et al.
Published: (2025)
Self-Captioning Multimodal Interaction Tuning: Amplifying Exploitable Redundancies for Robust Vision Language Models
by: Ryan, Yuriel, et al.
Published: (2026)
by: Ryan, Yuriel, et al.
Published: (2026)
Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation
by: Li, Jiachen, et al.
Published: (2024)
by: Li, Jiachen, et al.
Published: (2024)
Reinforced sequential Monte Carlo for amortised sampling
by: Choi, Sanghyeok, et al.
Published: (2025)
by: Choi, Sanghyeok, et al.
Published: (2025)
Inferring Traffic Models in Terminal Airspace from Flight Tracks and Procedures
by: Jung, Soyeon, et al.
Published: (2023)
by: Jung, Soyeon, et al.
Published: (2023)
The Synergy Between Optimal Transport Theory and Multi-Agent Reinforcement Learning
by: Baheri, Ali, et al.
Published: (2024)
by: Baheri, Ali, et al.
Published: (2024)
Valid Inference with Imperfect Synthetic Data
by: Byun, Yewon, et al.
Published: (2025)
by: Byun, Yewon, et al.
Published: (2025)
Representation Stability in a Minimal Continual Learning Agent
by: Subramanian, Vishnu
Published: (2026)
by: Subramanian, Vishnu
Published: (2026)
An Iterative Bayesian Approach for System Identification based on Linear Gaussian Models
by: Tzikas, Alexandros E., et al.
Published: (2025)
by: Tzikas, Alexandros E., et al.
Published: (2025)
On Technique Identification and Threat-Actor Attribution using LLMs and Embedding Models
by: Guru, Kyla, et al.
Published: (2025)
by: Guru, Kyla, et al.
Published: (2025)
Learning to Defer for Causal Discovery with Imperfect Experts
by: Clivio, Oscar, et al.
Published: (2025)
by: Clivio, Oscar, et al.
Published: (2025)
Memory Allocation in Resource-Constrained Reinforcement Learning
by: Tamborski, Massimiliano, et al.
Published: (2025)
by: Tamborski, Massimiliano, et al.
Published: (2025)
TolerantECG: A Foundation Model for Imperfect Electrocardiogram
by: Nguyen, Huynh Dang, et al.
Published: (2025)
by: Nguyen, Huynh Dang, et al.
Published: (2025)
Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective
by: Huang, Jiawei, et al.
Published: (2025)
by: Huang, Jiawei, et al.
Published: (2025)
An Imperfect Verifier is Good Enough: Learning with Noisy Rewards
by: Plesner, Andreas, et al.
Published: (2026)
by: Plesner, Andreas, et al.
Published: (2026)
Robust Causal Discovery under Imperfect Structural Constraints
by: Wang, Zidong, et al.
Published: (2025)
by: Wang, Zidong, et al.
Published: (2025)
Assistax: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics
by: Hinckeldey, Leonard, et al.
Published: (2025)
by: Hinckeldey, Leonard, et al.
Published: (2025)
Robust Planning for Autonomous Vehicles with Diffusion-Based Failure Samplers
by: Wang, Juanran, et al.
Published: (2025)
by: Wang, Juanran, et al.
Published: (2025)
Hierarchical Apprenticeship Learning from Imperfect Demonstrations with Evolving Rewards
by: Islam, Md Mirajul, et al.
Published: (2026)
by: Islam, Md Mirajul, et al.
Published: (2026)
Diffusion Models for Safety Validation of Autonomous Driving Systems
by: Wang, Juanran, et al.
Published: (2025)
by: Wang, Juanran, et al.
Published: (2025)
Optimizing Falsification for Learning-Based Control Systems: A Multi-Fidelity Bayesian Approach
by: Shahrooei, Zahra, et al.
Published: (2024)
by: Shahrooei, Zahra, et al.
Published: (2024)
Optimal Ground Station Selection for Low-Earth Orbiting Satellites
by: Eddy, Duncan, et al.
Published: (2024)
by: Eddy, Duncan, et al.
Published: (2024)
Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments
by: Lange, Bernard, et al.
Published: (2023)
by: Lange, Bernard, et al.
Published: (2023)
Cooperative Bayesian Optimization for Imperfect Agents
by: Khoshvishkaie, Ali, et al.
Published: (2024)
by: Khoshvishkaie, Ali, et al.
Published: (2024)
Self-Play Reinforcement Learning under Imperfect Information in Big 2
by: Patwa, Aalok
Published: (2026)
by: Patwa, Aalok
Published: (2026)
Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers
by: Cai, Xin-Qiang, et al.
Published: (2025)
by: Cai, Xin-Qiang, et al.
Published: (2025)
Causal Discovery from Heteroscedastic Stochastic Dynamical Systems under Imperfect Physical Models
by: Chen, Jianhong, et al.
Published: (2026)
by: Chen, Jianhong, et al.
Published: (2026)
A Semi-Decentralized Approach to Multiagent Control
by: Al-Husseini, Mahdi, et al.
Published: (2026)
by: Al-Husseini, Mahdi, et al.
Published: (2026)
Similar Items
-
Repairing Reward Functions with Feedback to Mitigate Reward Hacking
by: Hatgis-Kessell, Stephane, et al.
Published: (2025) -
Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization
by: Chaubard, Francois, et al.
Published: (2025) -
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering
by: Chaubard, Francois, et al.
Published: (2024) -
Failure Probability Estimation for Black-Box Autonomous Systems using State-Dependent Importance Sampling Proposals
by: Delecki, Harrison, et al.
Published: (2024) -
Graph Q-Learning for Combinatorial Optimization
by: Dax, Victoria M., et al.
Published: (2024)