Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Chaudhry, Faiz Muhammad, Ralli, Jarno, Leudet, Jerome, Sohrab, Fahad, Pakdaman, Farhad, Corbani, Pierre, Gabbouj, Moncef
Formato:	Preprint
Publicado:	2025
Materias:	Computer Vision and Pattern Recognition Graphics Machine Learning
Acceso en línea:	https://arxiv.org/abs/2501.14510
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866914177967194112
author	Chaudhry, Faiz Muhammad Ralli, Jarno Leudet, Jerome Sohrab, Fahad Pakdaman, Farhad Corbani, Pierre Gabbouj, Moncef
author_facet	Chaudhry, Faiz Muhammad Ralli, Jarno Leudet, Jerome Sohrab, Fahad Pakdaman, Farhad Corbani, Pierre Gabbouj, Moncef
contents	This research addresses the challenge of camera calibration and distortion parameter prediction from a single image using deep learning models. The main contributions of this work are: (1) demonstrating that a deep learning model, trained on a mix of real and synthetic images, can accurately predict camera and lens parameters from a single image, and (2) developing a comprehensive synthetic dataset using the AILiveSim simulation platform. This dataset includes variations in focal length and lens distortion parameters, providing a robust foundation for model training and testing. The training process predominantly relied on these synthetic images, complemented by a small subset of real images, to explore how well models trained on synthetic data can perform calibration tasks on real-world images. Traditional calibration methods require multiple images of a calibration object from various orientations, which is often not feasible due to the lack of such images in publicly available datasets. A deep learning network based on the ResNet architecture was trained on this synthetic dataset to predict camera calibration parameters following the Brown-Conrady lens model. The ResNet architecture, adapted for regression tasks, is capable of predicting continuous values essential for accurate camera calibration in applications such as autonomous driving, robotics, and augmented reality. Keywords: Camera calibration, distortion, synthetic data, deep learning, residual networks (ResNet), AILiveSim, horizontal field-of-view, principal point, Brown-Conrady Model.
format	Preprint
id	arxiv_https___arxiv_org_abs_2501_14510
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data Chaudhry, Faiz Muhammad Ralli, Jarno Leudet, Jerome Sohrab, Fahad Pakdaman, Farhad Corbani, Pierre Gabbouj, Moncef Computer Vision and Pattern Recognition Graphics Machine Learning This research addresses the challenge of camera calibration and distortion parameter prediction from a single image using deep learning models. The main contributions of this work are: (1) demonstrating that a deep learning model, trained on a mix of real and synthetic images, can accurately predict camera and lens parameters from a single image, and (2) developing a comprehensive synthetic dataset using the AILiveSim simulation platform. This dataset includes variations in focal length and lens distortion parameters, providing a robust foundation for model training and testing. The training process predominantly relied on these synthetic images, complemented by a small subset of real images, to explore how well models trained on synthetic data can perform calibration tasks on real-world images. Traditional calibration methods require multiple images of a calibration object from various orientations, which is often not feasible due to the lack of such images in publicly available datasets. A deep learning network based on the ResNet architecture was trained on this synthetic dataset to predict camera calibration parameters following the Brown-Conrady lens model. The ResNet architecture, adapted for regression tasks, is capable of predicting continuous values essential for accurate camera calibration in applications such as autonomous driving, robotics, and augmented reality. Keywords: Camera calibration, distortion, synthetic data, deep learning, residual networks (ResNet), AILiveSim, horizontal field-of-view, principal point, Brown-Conrady Model.
title	Deep-BrownConrady: Prediction of Camera Calibration and Distortion Parameters Using Deep Learning and Synthetic Data
topic	Computer Vision and Pattern Recognition Graphics Machine Learning
url	https://arxiv.org/abs/2501.14510

Ejemplares similares