Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Charoenpitaks, Korawat, Nguyen, Van-Quang, Suganuma, Masanori, Takahashi, Masahiro, Niihara, Ryoma, Okatani, Takayuki
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2310.04671
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929403014938624
author	Charoenpitaks, Korawat Nguyen, Van-Quang Suganuma, Masanori Takahashi, Masahiro Niihara, Ryoma Okatani, Takayuki
author_facet	Charoenpitaks, Korawat Nguyen, Van-Quang Suganuma, Masanori Takahashi, Masahiro Niihara, Ryoma Okatani, Takayuki
contents	This paper addresses the problem of predicting hazards that drivers may encounter while driving a car. We formulate it as a task of anticipating impending accidents using a single input image captured by car dashcams. Unlike existing approaches to driving hazard prediction that rely on computational simulations or anomaly detection from videos, this study focuses on high-level inference from static images. The problem needs predicting and reasoning about future events based on uncertain observations, which falls under visual abductive reasoning. To enable research in this understudied area, a new dataset named the DHPR (Driving Hazard Prediction and Reasoning) dataset is created. The dataset consists of 15K dashcam images of street scenes, and each image is associated with a tuple containing car speed, a hypothesized hazard description, and visual entities present in the scene. These are annotated by human annotators, who identify risky scenes and provide descriptions of potential accidents that could occur a few seconds later. We present several baseline methods and evaluate their performance on our dataset, identifying remaining issues and discussing future directions. This study contributes to the field by introducing a novel problem formulation and dataset, enabling researchers to explore the potential of multi-modal AI for driving hazard prediction.
format	Preprint
id	arxiv_https___arxiv_org_abs_2310_04671
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Exploring the Potential of Multi-Modal AI for Driving Hazard Prediction Charoenpitaks, Korawat Nguyen, Van-Quang Suganuma, Masanori Takahashi, Masahiro Niihara, Ryoma Okatani, Takayuki Computer Vision and Pattern Recognition This paper addresses the problem of predicting hazards that drivers may encounter while driving a car. We formulate it as a task of anticipating impending accidents using a single input image captured by car dashcams. Unlike existing approaches to driving hazard prediction that rely on computational simulations or anomaly detection from videos, this study focuses on high-level inference from static images. The problem needs predicting and reasoning about future events based on uncertain observations, which falls under visual abductive reasoning. To enable research in this understudied area, a new dataset named the DHPR (Driving Hazard Prediction and Reasoning) dataset is created. The dataset consists of 15K dashcam images of street scenes, and each image is associated with a tuple containing car speed, a hypothesized hazard description, and visual entities present in the scene. These are annotated by human annotators, who identify risky scenes and provide descriptions of potential accidents that could occur a few seconds later. We present several baseline methods and evaluate their performance on our dataset, identifying remaining issues and discussing future directions. This study contributes to the field by introducing a novel problem formulation and dataset, enabling researchers to explore the potential of multi-modal AI for driving hazard prediction.
title	Exploring the Potential of Multi-Modal AI for Driving Hazard Prediction
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2310.04671

Similar Items