Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sutar, Vamshika, Maheshwari, Mahek, Mittal, Archak
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2511.06295
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866911255945543680
author Sutar, Vamshika
Maheshwari, Mahek
Mittal, Archak
author_facet Sutar, Vamshika
Maheshwari, Mahek
Mittal, Archak
contents The automation of material handling in warehouses increasingly relies on robust, low cost perception systems for forklifts and Automated Guided Vehicles (AGVs). This work presents a vision based framework for pallet and pallet hole detection and mapping using a single standard camera. We utilized YOLOv8 and YOLOv11 architectures, enhanced through Optuna driven hyperparameter optimization and spatial post processing. An innovative pallet hole mapping module converts the detections into actionable spatial representations, enabling accurate pallet and pallet hole association for forklift operation. Experiments on a custom dataset augmented with real warehouse imagery show that YOLOv8 achieves high pallet and pallet hole detection accuracy, while YOLOv11, particularly under optimized configurations, offers superior precision and stable convergence. The results demonstrate the feasibility of a cost effective, retrofittable visual perception module for forklifts. This study proposes a scalable approach to advancing warehouse automation, promoting safer, economical, and intelligent logistics operations.
format Preprint
id arxiv_https___arxiv_org_abs_2511_06295
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Learning-Based Vision Systems for Semi-Autonomous Forklift Operation in Industrial Warehouse Environments
Sutar, Vamshika
Maheshwari, Mahek
Mittal, Archak
Computer Vision and Pattern Recognition
The automation of material handling in warehouses increasingly relies on robust, low cost perception systems for forklifts and Automated Guided Vehicles (AGVs). This work presents a vision based framework for pallet and pallet hole detection and mapping using a single standard camera. We utilized YOLOv8 and YOLOv11 architectures, enhanced through Optuna driven hyperparameter optimization and spatial post processing. An innovative pallet hole mapping module converts the detections into actionable spatial representations, enabling accurate pallet and pallet hole association for forklift operation. Experiments on a custom dataset augmented with real warehouse imagery show that YOLOv8 achieves high pallet and pallet hole detection accuracy, while YOLOv11, particularly under optimized configurations, offers superior precision and stable convergence. The results demonstrate the feasibility of a cost effective, retrofittable visual perception module for forklifts. This study proposes a scalable approach to advancing warehouse automation, promoting safer, economical, and intelligent logistics operations.
title Learning-Based Vision Systems for Semi-Autonomous Forklift Operation in Industrial Warehouse Environments
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2511.06295