Saved in:
Bibliographic Details
Main Authors: Sheng, Shili, Yu, Pian, Parker, David, Kwiatkowska, Marta, Feng, Lu
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.15557
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908043514478592
author Sheng, Shili
Yu, Pian
Parker, David
Kwiatkowska, Marta
Feng, Lu
author_facet Sheng, Shili
Yu, Pian
Parker, David
Kwiatkowska, Marta
Feng, Lu
contents Online planning for partially observable Markov decision processes (POMDPs) provides efficient techniques for robot decision-making under uncertainty. However, existing methods fall short of preventing safety violations in dynamic environments. This work presents a novel safe POMDP online planning approach that maximizes expected returns while providing probabilistic safety guarantees amidst environments populated by multiple dynamic agents. Our approach utilizes data-driven trajectory prediction models of dynamic agents and applies Adaptive Conformal Prediction (ACP) to quantify the uncertainties in these predictions. Leveraging the obtained ACP-based trajectory predictions, our approach constructs safety shields on-the-fly to prevent unsafe actions within POMDP online planning. Through experimental evaluation in various dynamic environments using real-world pedestrian trajectory data, the proposed approach has been shown to effectively maintain probabilistic safety guarantees while accommodating up to hundreds of dynamic agents.
format Preprint
id arxiv_https___arxiv_org_abs_2404_15557
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Safe POMDP Online Planning among Dynamic Agents via Adaptive Conformal Prediction
Sheng, Shili
Yu, Pian
Parker, David
Kwiatkowska, Marta
Feng, Lu
Robotics
Online planning for partially observable Markov decision processes (POMDPs) provides efficient techniques for robot decision-making under uncertainty. However, existing methods fall short of preventing safety violations in dynamic environments. This work presents a novel safe POMDP online planning approach that maximizes expected returns while providing probabilistic safety guarantees amidst environments populated by multiple dynamic agents. Our approach utilizes data-driven trajectory prediction models of dynamic agents and applies Adaptive Conformal Prediction (ACP) to quantify the uncertainties in these predictions. Leveraging the obtained ACP-based trajectory predictions, our approach constructs safety shields on-the-fly to prevent unsafe actions within POMDP online planning. Through experimental evaluation in various dynamic environments using real-world pedestrian trajectory data, the proposed approach has been shown to effectively maintain probabilistic safety guarantees while accommodating up to hundreds of dynamic agents.
title Safe POMDP Online Planning among Dynamic Agents via Adaptive Conformal Prediction
topic Robotics
url https://arxiv.org/abs/2404.15557