Saved in:
Bibliographic Details
Main Authors: Chakrabarty, Yashashwee, Sarangi, Smruti Ranjan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.15067
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929429353070592
author Chakrabarty, Yashashwee
Sarangi, Smruti Ranjan
author_facet Chakrabarty, Yashashwee
Sarangi, Smruti Ranjan
contents Autonomous mobile robots like self-flying drones and industrial robots heavily depend on depth images to perform tasks such as 3D reconstruction and visual SLAM. However, the presence of inaccuracies in these depth images can greatly hinder the effectiveness of these applications, resulting in sub-optimal results. Depth images produced by commercially available cameras frequently exhibit noise, which manifests as flickering pixels and erroneous patches. ML-based methods to rectify these images are unsuitable for edge devices that have very limited computational resources. Non-ML methods are much faster but have limited accuracy, especially for correcting errors that are a result of occlusion and camera movement. We propose a scheme called VoxDepth that is fast, accurate, and runs very well on edge devices. It relies on a host of novel techniques: 3D point cloud construction and fusion, and using it to create a template that can fix erroneous depth images. VoxDepth shows superior results on both synthetic and real-world datasets. We demonstrate a 31% improvement in quality as compared to state-of-the-art methods on real-world depth datasets, while maintaining a competitive framerate of 27 FPS (frames per second).
format Preprint
id arxiv_https___arxiv_org_abs_2407_15067
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle VoxDepth: Rectification of Depth Images on Edge Devices
Chakrabarty, Yashashwee
Sarangi, Smruti Ranjan
Computer Vision and Pattern Recognition
Autonomous mobile robots like self-flying drones and industrial robots heavily depend on depth images to perform tasks such as 3D reconstruction and visual SLAM. However, the presence of inaccuracies in these depth images can greatly hinder the effectiveness of these applications, resulting in sub-optimal results. Depth images produced by commercially available cameras frequently exhibit noise, which manifests as flickering pixels and erroneous patches. ML-based methods to rectify these images are unsuitable for edge devices that have very limited computational resources. Non-ML methods are much faster but have limited accuracy, especially for correcting errors that are a result of occlusion and camera movement. We propose a scheme called VoxDepth that is fast, accurate, and runs very well on edge devices. It relies on a host of novel techniques: 3D point cloud construction and fusion, and using it to create a template that can fix erroneous depth images. VoxDepth shows superior results on both synthetic and real-world datasets. We demonstrate a 31% improvement in quality as compared to state-of-the-art methods on real-world depth datasets, while maintaining a competitive framerate of 27 FPS (frames per second).
title VoxDepth: Rectification of Depth Images on Edge Devices
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2407.15067