Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hidayatullah, Priyanto, Syakrani, Nurjannah, Widhiyasana, Yudi, Sholahuddin, Muhammad Rizqi, Tubagus, Refdinal, Hidayat, Zahri Al Adzani, Ramadhan, Hanri Fajar, Pratama, Dafa Alfarizki, Yasin, Farhan Muhammad
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.04888
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909977350766592
author	Hidayatullah, Priyanto Syakrani, Nurjannah Widhiyasana, Yudi Sholahuddin, Muhammad Rizqi Tubagus, Refdinal Hidayat, Zahri Al Adzani Ramadhan, Hanri Fajar Pratama, Dafa Alfarizki Yasin, Farhan Muhammad
author_facet	Hidayatullah, Priyanto Syakrani, Nurjannah Widhiyasana, Yudi Sholahuddin, Muhammad Rizqi Tubagus, Refdinal Hidayat, Zahri Al Adzani Ramadhan, Hanri Fajar Pratama, Dafa Alfarizki Yasin, Farhan Muhammad
contents	Object detection constitutes the primary task within the domain of computer vision. It is utilized in numerous domains. Nonetheless, object detection continues to encounter the issue of catastrophic forgetting. The model must be retrained whenever new products are introduced, utilizing not only the new products dataset but also the entirety of the previous dataset. The outcome is obvious: increasing model training expenses and significant time consumption. In numerous sectors, particularly retail checkout, the frequent introduction of new products presents a great challenge. This study introduces Zero-Retraining Based Recognition and Object Detection (ZeBROD), a methodology designed to address the issue of catastrophic forgetting by integrating YOLO11n for object localization with DeIT and Proxy Anchor Loss for feature extraction and metric learning. For classification, we utilize cosine similarity between the embedding features of the target product and those in the Qdrant vector database. In a case study conducted in a retail store with 140 products, the experimental results demonstrate that our proposed framework achieves encouraging accuracy, whether for detecting new or existing products. Furthermore, without retraining, the training duration difference is significant. We achieve almost 3 times the training time efficiency compared to classical object detection approaches. This efficiency escalates as additional new products are added to the product database. The average inference time is 580 ms per image containing multiple products, on an edge device, validating the proposed framework's feasibility for practical use.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_04888
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	ZeBROD: Zero-Retraining Based Recognition and Object Detection Framework Hidayatullah, Priyanto Syakrani, Nurjannah Widhiyasana, Yudi Sholahuddin, Muhammad Rizqi Tubagus, Refdinal Hidayat, Zahri Al Adzani Ramadhan, Hanri Fajar Pratama, Dafa Alfarizki Yasin, Farhan Muhammad Computer Vision and Pattern Recognition Object detection constitutes the primary task within the domain of computer vision. It is utilized in numerous domains. Nonetheless, object detection continues to encounter the issue of catastrophic forgetting. The model must be retrained whenever new products are introduced, utilizing not only the new products dataset but also the entirety of the previous dataset. The outcome is obvious: increasing model training expenses and significant time consumption. In numerous sectors, particularly retail checkout, the frequent introduction of new products presents a great challenge. This study introduces Zero-Retraining Based Recognition and Object Detection (ZeBROD), a methodology designed to address the issue of catastrophic forgetting by integrating YOLO11n for object localization with DeIT and Proxy Anchor Loss for feature extraction and metric learning. For classification, we utilize cosine similarity between the embedding features of the target product and those in the Qdrant vector database. In a case study conducted in a retail store with 140 products, the experimental results demonstrate that our proposed framework achieves encouraging accuracy, whether for detecting new or existing products. Furthermore, without retraining, the training duration difference is significant. We achieve almost 3 times the training time efficiency compared to classical object detection approaches. This efficiency escalates as additional new products are added to the product database. The average inference time is 580 ms per image containing multiple products, on an edge device, validating the proposed framework's feasibility for practical use.
title	ZeBROD: Zero-Retraining Based Recognition and Object Detection Framework
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2512.04888

Similar Items