Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cortés, Andoni, Rodríguez, Clemente, Velez, Gorka, Barandiarán, Javier, Nieto, Marcos
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2410.22748
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913567939231744
author	Cortés, Andoni Rodríguez, Clemente Velez, Gorka Barandiarán, Javier Nieto, Marcos
author_facet	Cortés, Andoni Rodríguez, Clemente Velez, Gorka Barandiarán, Javier Nieto, Marcos
contents	A major challenges of deep learning (DL) is the necessity to collect huge amounts of training data. Often, the lack of a sufficiently large dataset discourages the use of DL in certain applications. Typically, acquiring the required amounts of data costs considerable time, material and effort. To mitigate this problem, the use of synthetic images combined with real data is a popular approach, widely adopted in the scientific community to effectively train various detectors. In this study, we examined the potential of synthetic data-based training in the field of intelligent transportation systems. Our focus is on camera-based traffic sign recognition applications for advanced driver assistance systems and autonomous driving. The proposed augmentation pipeline of synthetic datasets includes novel augmentation processes such as structured shadows and gaussian specular highlights. A well-known DL model was trained with different datasets to compare the performance of synthetic and real image-based trained models. Additionally, a new, detailed method to objectively compare these models is proposed. Synthetic images are generated using a semi-supervised errors-guide method which is also described. Our experiments showed that a synthetic image-based approach outperforms in most cases real image-based training when applied to cross-domain test datasets (+10% precision for GTSRB dataset) and consequently, the generalization of the model is improved decreasing the cost of acquiring images.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_22748
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets Cortés, Andoni Rodríguez, Clemente Velez, Gorka Barandiarán, Javier Nieto, Marcos Computer Vision and Pattern Recognition A major challenges of deep learning (DL) is the necessity to collect huge amounts of training data. Often, the lack of a sufficiently large dataset discourages the use of DL in certain applications. Typically, acquiring the required amounts of data costs considerable time, material and effort. To mitigate this problem, the use of synthetic images combined with real data is a popular approach, widely adopted in the scientific community to effectively train various detectors. In this study, we examined the potential of synthetic data-based training in the field of intelligent transportation systems. Our focus is on camera-based traffic sign recognition applications for advanced driver assistance systems and autonomous driving. The proposed augmentation pipeline of synthetic datasets includes novel augmentation processes such as structured shadows and gaussian specular highlights. A well-known DL model was trained with different datasets to compare the performance of synthetic and real image-based trained models. Additionally, a new, detailed method to objectively compare these models is proposed. Synthetic images are generated using a semi-supervised errors-guide method which is also described. Our experiments showed that a synthetic image-based approach outperforms in most cases real image-based training when applied to cross-domain test datasets (+10% precision for GTSRB dataset) and consequently, the generalization of the model is improved decreasing the cost of acquiring images.
title	Analysis of Classifier Training on Synthetic Data for Cross-Domain Datasets
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2410.22748

Similar Items