Guardado en:
Detalles Bibliográficos
Autores principales: Hossain, Ha Meem, Nath, Pritam, Mahi, Mahitun Nesa, Uddin, Imtiaz, Eiste, Ishrat Jahan, Ratul, Syed Nasibur Rahman, Mozumdar, Md Naim Uddin, Saad, Asif Mohammed, Hossain, MD Tamim
Formato: Preprint
Publicado: 2025
Materias:
Acceso en línea:https://arxiv.org/abs/2509.05652
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866909839143206912
author Hossain, Ha Meem
Nath, Pritam
Mahi, Mahitun Nesa
Uddin, Imtiaz
Eiste, Ishrat Jahan
Ratul, Syed Nasibur Rahman
Mozumdar, Md Naim Uddin
Saad, Asif Mohammed
Hossain, MD Tamim
author_facet Hossain, Ha Meem
Nath, Pritam
Mahi, Mahitun Nesa
Uddin, Imtiaz
Eiste, Ishrat Jahan
Ratul, Syed Nasibur Rahman
Mozumdar, Md Naim Uddin
Saad, Asif Mohammed
Hossain, MD Tamim
contents Vehicle detection systems trained on Non-Bangladeshi datasets struggle to accurately identify local vehicle types in Bangladesh's unique road environments, creating critical gaps in autonomous driving technology for developing regions. This study evaluates six YOLO model variants on a custom dataset featuring 29 distinct vehicle classes, including region-specific vehicles such as ``Desi Nosimon'', ``Leguna'', ``Battery Rickshaw'', and ``CNG''. The dataset comprises high-resolution images (1920x1080) captured across various Bangladeshi roads using mobile phone cameras and manually annotated using LabelImg with YOLO format bounding boxes. Performance evaluation revealed YOLOv11x as the top performer, achieving 63.7\% mAP@0.5, 43.8\% mAP@0.5:0.95, 61.4\% recall, and 61.6\% F1-score, though requiring 45.8 milliseconds per image for inference. Medium variants (YOLOv8m, YOLOv11m) struck an optimal balance, delivering robust detection performance with mAP@0.5 values of 62.5\% and 61.8\% respectively, while maintaining moderate inference times around 14-15 milliseconds. The study identified significant detection challenges for rare vehicle classes, with Construction Vehicles and Desi Nosimons showing near-zero accuracy due to dataset imbalances and insufficient training samples. Confusion matrices revealed frequent misclassifications between visually similar vehicles, particularly Mini Trucks versus Mini Covered Vans. This research provides a foundation for developing robust object detection systems specifically adapted to Bangladesh traffic conditions, addressing critical needs in autonomous vehicle technology advancement for developing regions where conventional generic-trained models fail to perform adequately.
format Preprint
id arxiv_https___arxiv_org_abs_2509_05652
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh
Hossain, Ha Meem
Nath, Pritam
Mahi, Mahitun Nesa
Uddin, Imtiaz
Eiste, Ishrat Jahan
Ratul, Syed Nasibur Rahman
Mozumdar, Md Naim Uddin
Saad, Asif Mohammed
Hossain, MD Tamim
Computer Vision and Pattern Recognition
Vehicle detection systems trained on Non-Bangladeshi datasets struggle to accurately identify local vehicle types in Bangladesh's unique road environments, creating critical gaps in autonomous driving technology for developing regions. This study evaluates six YOLO model variants on a custom dataset featuring 29 distinct vehicle classes, including region-specific vehicles such as ``Desi Nosimon'', ``Leguna'', ``Battery Rickshaw'', and ``CNG''. The dataset comprises high-resolution images (1920x1080) captured across various Bangladeshi roads using mobile phone cameras and manually annotated using LabelImg with YOLO format bounding boxes. Performance evaluation revealed YOLOv11x as the top performer, achieving 63.7\% mAP@0.5, 43.8\% mAP@0.5:0.95, 61.4\% recall, and 61.6\% F1-score, though requiring 45.8 milliseconds per image for inference. Medium variants (YOLOv8m, YOLOv11m) struck an optimal balance, delivering robust detection performance with mAP@0.5 values of 62.5\% and 61.8\% respectively, while maintaining moderate inference times around 14-15 milliseconds. The study identified significant detection challenges for rare vehicle classes, with Construction Vehicles and Desi Nosimons showing near-zero accuracy due to dataset imbalances and insufficient training samples. Confusion matrices revealed frequent misclassifications between visually similar vehicles, particularly Mini Trucks versus Mini Covered Vans. This research provides a foundation for developing robust object detection systems specifically adapted to Bangladesh traffic conditions, addressing critical needs in autonomous vehicle technology advancement for developing regions where conventional generic-trained models fail to perform adequately.
title Evaluating YOLO Architectures: Implications for Real-Time Vehicle Detection in Urban Environments of Bangladesh
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2509.05652