Saved in:
Bibliographic Details
Main Authors: Karramreddy, Venkat, Mitchell, Liam
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.13402
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909321895346176
author Karramreddy, Venkat
Mitchell, Liam
author_facet Karramreddy, Venkat
Mitchell, Liam
contents This article presents an innovative study in exploring, evaluating, and implementing deep learning architectures for the calibration of multi-modal sensor systems. The focus behind this is to leverage the use of sensor fusion to achieve dynamic, real-time alignment between 3D LiDAR and 2D Camera sensors. static calibration methods are tedious and time-consuming, which is why we propose utilizing Conventional Neural Networks (CNN) coupled with geometrically informed learning to solve this issue. We leverage the foundational principles of Extrinsic LiDAR-Camera Calibration tools such as RegNet, CalibNet, and LCCNet by exploring open-source models that are available online and comparing our results with their corresponding research papers. Requirements for extracting these visual and measurable outputs involved tweaking source code, fine-tuning, training, validation, and testing for each of these frameworks for equal comparisons. This approach aims to investigate which of these advanced networks produces the most accurate and consistent predictions. Through a series of experiments, we reveal some of their shortcomings and areas for potential improvements along the way. We find that LCCNet yields the best results out of all the models that we validated.
format Preprint
id arxiv_https___arxiv_org_abs_2409_13402
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Validation & Exploration of Multimodal Deep-Learning Camera-Lidar Calibration models
Karramreddy, Venkat
Mitchell, Liam
Computer Vision and Pattern Recognition
Artificial Intelligence
Robotics
This article presents an innovative study in exploring, evaluating, and implementing deep learning architectures for the calibration of multi-modal sensor systems. The focus behind this is to leverage the use of sensor fusion to achieve dynamic, real-time alignment between 3D LiDAR and 2D Camera sensors. static calibration methods are tedious and time-consuming, which is why we propose utilizing Conventional Neural Networks (CNN) coupled with geometrically informed learning to solve this issue. We leverage the foundational principles of Extrinsic LiDAR-Camera Calibration tools such as RegNet, CalibNet, and LCCNet by exploring open-source models that are available online and comparing our results with their corresponding research papers. Requirements for extracting these visual and measurable outputs involved tweaking source code, fine-tuning, training, validation, and testing for each of these frameworks for equal comparisons. This approach aims to investigate which of these advanced networks produces the most accurate and consistent predictions. Through a series of experiments, we reveal some of their shortcomings and areas for potential improvements along the way. We find that LCCNet yields the best results out of all the models that we validated.
title Validation & Exploration of Multimodal Deep-Learning Camera-Lidar Calibration models
topic Computer Vision and Pattern Recognition
Artificial Intelligence
Robotics
url https://arxiv.org/abs/2409.13402