Saved in:
Bibliographic Details
Main Authors: Zheng, Xu, Liu, Yexin, Lu, Yunfan, Hua, Tongyan, Pan, Tianbo, Zhang, Weiming, Tao, Dacheng, Wang, Lin
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2302.08890
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910405571379200
author Zheng, Xu
Liu, Yexin
Lu, Yunfan
Hua, Tongyan
Pan, Tianbo
Zhang, Weiming
Tao, Dacheng
Wang, Lin
author_facet Zheng, Xu
Liu, Yexin
Lu, Yunfan
Hua, Tongyan
Pan, Tianbo
Zhang, Weiming
Tao, Dacheng
Wang, Lin
contents Event cameras are bio-inspired sensors that capture the per-pixel intensity changes asynchronously and produce event streams encoding the time, pixel position, and polarity (sign) of the intensity changes. Event cameras possess a myriad of advantages over canonical frame-based cameras, such as high temporal resolution, high dynamic range, low latency, etc. Being capable of capturing information in challenging visual conditions, event cameras have the potential to overcome the limitations of frame-based cameras in the computer vision and robotics community. In very recent years, deep learning (DL) has been brought to this emerging field and inspired active research endeavors in mining its potential. However, there is still a lack of taxonomies in DL techniques for event-based vision. We first scrutinize the typical event representations with quality enhancement methods as they play a pivotal role as inputs to the DL models. We then provide a comprehensive survey of existing DL-based methods by structurally grouping them into two major categories: 1) image/video reconstruction and restoration; 2) event-based scene understanding and 3D vision. We conduct benchmark experiments for the existing methods in some representative research directions, i.e., image reconstruction, deblurring, and object recognition, to identify some critical insights and problems. Finally, we have discussions regarding the challenges and provide new perspectives for inspiring more research studies.
format Preprint
id arxiv_https___arxiv_org_abs_2302_08890
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks
Zheng, Xu
Liu, Yexin
Lu, Yunfan
Hua, Tongyan
Pan, Tianbo
Zhang, Weiming
Tao, Dacheng
Wang, Lin
Computer Vision and Pattern Recognition
Event cameras are bio-inspired sensors that capture the per-pixel intensity changes asynchronously and produce event streams encoding the time, pixel position, and polarity (sign) of the intensity changes. Event cameras possess a myriad of advantages over canonical frame-based cameras, such as high temporal resolution, high dynamic range, low latency, etc. Being capable of capturing information in challenging visual conditions, event cameras have the potential to overcome the limitations of frame-based cameras in the computer vision and robotics community. In very recent years, deep learning (DL) has been brought to this emerging field and inspired active research endeavors in mining its potential. However, there is still a lack of taxonomies in DL techniques for event-based vision. We first scrutinize the typical event representations with quality enhancement methods as they play a pivotal role as inputs to the DL models. We then provide a comprehensive survey of existing DL-based methods by structurally grouping them into two major categories: 1) image/video reconstruction and restoration; 2) event-based scene understanding and 3D vision. We conduct benchmark experiments for the existing methods in some representative research directions, i.e., image reconstruction, deblurring, and object recognition, to identify some critical insights and problems. Finally, we have discussions regarding the challenges and provide new perspectives for inspiring more research studies.
title Deep Learning for Event-based Vision: A Comprehensive Survey and Benchmarks
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2302.08890