Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.07706 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866916455410302976 |
|---|---|
| author | Rosbach, Sascha Leupold, Stefan M. Großjohann, Simon Roth, Stefan |
| author_facet | Rosbach, Sascha Leupold, Stefan M. Großjohann, Simon Roth, Stefan |
| contents | Automated vehicles operating in urban environments have to reliably interact with other traffic participants. Planning algorithms often utilize separate prediction modules forecasting probabilistic, multi-modal, and interactive behaviors of objects. Designing prediction and planning as two separate modules introduces significant challenges, particularly due to the interdependence of these modules. This work proposes a deep learning methodology to combine prediction and planning. A conditional GAN with the U-Net architecture is trained to predict two high-resolution image sequences. The sequences represent explicit motion predictions, mainly used to train context understanding, and pixel state values suitable for planning encoding kinematic reachability, object dynamics, safety, and driving comfort. The model can be trained offline on target images rendered by a sampling-based model-predictive planner, leveraging real-world driving data. Our results demonstrate intuitive behavior in complex situations, such as lane changes amidst conflicting objectives. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2310_07706 |
| institution | arXiv |
| publishDate | 2023 |
| record_format | arxiv |
| spellingShingle | Pixel State Value Network for Combined Prediction and Planning in Interactive Environments Rosbach, Sascha Leupold, Stefan M. Großjohann, Simon Roth, Stefan Robotics Artificial Intelligence Automated vehicles operating in urban environments have to reliably interact with other traffic participants. Planning algorithms often utilize separate prediction modules forecasting probabilistic, multi-modal, and interactive behaviors of objects. Designing prediction and planning as two separate modules introduces significant challenges, particularly due to the interdependence of these modules. This work proposes a deep learning methodology to combine prediction and planning. A conditional GAN with the U-Net architecture is trained to predict two high-resolution image sequences. The sequences represent explicit motion predictions, mainly used to train context understanding, and pixel state values suitable for planning encoding kinematic reachability, object dynamics, safety, and driving comfort. The model can be trained offline on target images rendered by a sampling-based model-predictive planner, leveraging real-world driving data. Our results demonstrate intuitive behavior in complex situations, such as lane changes amidst conflicting objectives. |
| title | Pixel State Value Network for Combined Prediction and Planning in Interactive Environments |
| topic | Robotics Artificial Intelligence |
| url | https://arxiv.org/abs/2310.07706 |