Saved in:
Bibliographic Details
Main Authors: Liu, Ruiliang, Li, Tina Dongxu, Migdal, Joshua, Ruch, Fernando, Meszaros, Kenneth, Dardik, Moses Trevor
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2606.00321
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913174901489664
author Liu, Ruiliang
Li, Tina Dongxu
Migdal, Joshua
Ruch, Fernando
Meszaros, Kenneth
Dardik, Moses Trevor
author_facet Liu, Ruiliang
Li, Tina Dongxu
Migdal, Joshua
Ruch, Fernando
Meszaros, Kenneth
Dardik, Moses Trevor
contents In fulfillment centers, diverse objects move continuously from inbound to outbound operations and can become jammed due to excessive conveyor friction, incorrect orientation, or mechanical failures. Traditional jam detection approaches rely on object detection models to identify objects, followed by tracking algorithms (such as IoU overlap and Kalman filtering) to monitor motion over time. This pipeline requires thousands of manual annotations, consuming approximately two weeks of effort, and is limited to annotated object classes. We present a training-free, object-agnostic jam detection method that eliminates the need for labeled data. Our approach uniformly samples reference points within the monitoring region when no objects are present. As objects occlude these points, we detect motion. When a sufficient fraction remains occluded beyond a temporal threshold, we classify the event as a jam. Unlike conventional point tracking--which treats occlusion as a failure case--our approach repurposes occlusion as a detection signal, monitoring whether reference points remain persistently occluded rather than tracking where they move. Our experimental evaluation on 1,069 videos demonstrates that AllTracker achieves 100.00% precision and 93.33% F1 score, significantly outperforming classical sparse tracking methods while maintaining training-free deployment. This approach offers three key advantages: (1) no training data or manual annotations, (2) object-agnostic generalization to arbitrary object types, and (3) significantly reduced development time.
format Preprint
id arxiv_https___arxiv_org_abs_2606_00321
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Training-Free Object-Agnostic Jam Detection in Fulfillment Centers
Liu, Ruiliang
Li, Tina Dongxu
Migdal, Joshua
Ruch, Fernando
Meszaros, Kenneth
Dardik, Moses Trevor
Computer Vision and Pattern Recognition
In fulfillment centers, diverse objects move continuously from inbound to outbound operations and can become jammed due to excessive conveyor friction, incorrect orientation, or mechanical failures. Traditional jam detection approaches rely on object detection models to identify objects, followed by tracking algorithms (such as IoU overlap and Kalman filtering) to monitor motion over time. This pipeline requires thousands of manual annotations, consuming approximately two weeks of effort, and is limited to annotated object classes. We present a training-free, object-agnostic jam detection method that eliminates the need for labeled data. Our approach uniformly samples reference points within the monitoring region when no objects are present. As objects occlude these points, we detect motion. When a sufficient fraction remains occluded beyond a temporal threshold, we classify the event as a jam. Unlike conventional point tracking--which treats occlusion as a failure case--our approach repurposes occlusion as a detection signal, monitoring whether reference points remain persistently occluded rather than tracking where they move. Our experimental evaluation on 1,069 videos demonstrates that AllTracker achieves 100.00% precision and 93.33% F1 score, significantly outperforming classical sparse tracking methods while maintaining training-free deployment. This approach offers three key advantages: (1) no training data or manual annotations, (2) object-agnostic generalization to arbitrary object types, and (3) significantly reduced development time.
title Training-Free Object-Agnostic Jam Detection in Fulfillment Centers
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2606.00321