Saved in:
Bibliographic Details
Main Authors: Ribeiro, Victor Nascimento, Hirata, Nina S. T.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.04534
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916558141390848
author Ribeiro, Victor Nascimento
Hirata, Nina S. T.
author_facet Ribeiro, Victor Nascimento
Hirata, Nina S. T.
contents Video-based vehicle detection and counting play a critical role in managing transport infrastructure. Traditional image-based counting methods usually involve two main steps: initial detection and subsequent tracking, which are applied to all video frames, leading to a significant increase in computational complexity. To address this issue, this work presents an alternative and more efficient method for vehicle detection and counting. The proposed approach eliminates the need for a tracking step and focuses solely on detecting vehicles in key video frames, thereby increasing its efficiency. To achieve this, we developed a system that combines YOLO, for vehicle detection, with Visual Rhythm, a way to create time-spatial images that allows us to focus on frames that contain useful information. Additionally, this method can be used for counting in any application involving unidirectional moving targets to be detected and identified. Experimental analysis using real videos shows that the proposed method achieves mean counting accuracy around 99.15% over a set of videos, with a processing speed three times faster than tracking based approaches.
format Preprint
id arxiv_https___arxiv_org_abs_2501_04534
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Combining YOLO and Visual Rhythm for Vehicle Counting
Ribeiro, Victor Nascimento
Hirata, Nina S. T.
Computer Vision and Pattern Recognition
Machine Learning
Video-based vehicle detection and counting play a critical role in managing transport infrastructure. Traditional image-based counting methods usually involve two main steps: initial detection and subsequent tracking, which are applied to all video frames, leading to a significant increase in computational complexity. To address this issue, this work presents an alternative and more efficient method for vehicle detection and counting. The proposed approach eliminates the need for a tracking step and focuses solely on detecting vehicles in key video frames, thereby increasing its efficiency. To achieve this, we developed a system that combines YOLO, for vehicle detection, with Visual Rhythm, a way to create time-spatial images that allows us to focus on frames that contain useful information. Additionally, this method can be used for counting in any application involving unidirectional moving targets to be detected and identified. Experimental analysis using real videos shows that the proposed method achieves mean counting accuracy around 99.15% over a set of videos, with a processing speed three times faster than tracking based approaches.
title Combining YOLO and Visual Rhythm for Vehicle Counting
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2501.04534