Saved in:
Bibliographic Details
Main Authors: Wang, Yingjie, Wang, Zhixing, Zheng, Le, Liu, Tianxiao, Li, Roujing, Hu, Xueyao
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.09796
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913988570251264
author Wang, Yingjie
Wang, Zhixing
Zheng, Le
Liu, Tianxiao
Li, Roujing
Hu, Xueyao
author_facet Wang, Yingjie
Wang, Zhixing
Zheng, Le
Liu, Tianxiao
Li, Roujing
Hu, Xueyao
contents Multi-object tracking (MOT) in human-dominant scenarios, which involves continuously tracking multiple people within video sequences, remains a significant challenge in computer vision due to targets' complex motion and severe occlusions. Conventional tracking-by-detection methods are fundamentally limited by their reliance on Kalman filter (KF) and rigid Intersection over Union (IoU)-based association. The motion model in KF often mismatches real-world object dynamics, causing filtering errors, while rigid association struggles under occlusions, leading to identity switches or target loss. To address these issues, we propose MeMoSORT, a simple, online, and real-time MOT tracker with two key innovations. First, the Memory-assisted Kalman filter (MeKF) uses memory-augmented neural networks to compensate for mismatches between assumed and actual object motion. Second, the Motion-adaptive IoU (Mo-IoU) adaptively expands the matching space and incorporates height similarity to reduce the influence of detection errors and association failures, while remaining lightweight. Experiments on DanceTrack and SportsMOT show that MeMoSORT achieves state-of-the-art performance, with HOTA scores of 67.9\% and 82.1\%, respectively.
format Preprint
id arxiv_https___arxiv_org_abs_2508_09796
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle MeMoSORT: Memory-Assisted Filtering and Motion-Adaptive Association Metric for Multi-Person Tracking
Wang, Yingjie
Wang, Zhixing
Zheng, Le
Liu, Tianxiao
Li, Roujing
Hu, Xueyao
Computer Vision and Pattern Recognition
Multi-object tracking (MOT) in human-dominant scenarios, which involves continuously tracking multiple people within video sequences, remains a significant challenge in computer vision due to targets' complex motion and severe occlusions. Conventional tracking-by-detection methods are fundamentally limited by their reliance on Kalman filter (KF) and rigid Intersection over Union (IoU)-based association. The motion model in KF often mismatches real-world object dynamics, causing filtering errors, while rigid association struggles under occlusions, leading to identity switches or target loss. To address these issues, we propose MeMoSORT, a simple, online, and real-time MOT tracker with two key innovations. First, the Memory-assisted Kalman filter (MeKF) uses memory-augmented neural networks to compensate for mismatches between assumed and actual object motion. Second, the Motion-adaptive IoU (Mo-IoU) adaptively expands the matching space and incorporates height similarity to reduce the influence of detection errors and association failures, while remaining lightweight. Experiments on DanceTrack and SportsMOT show that MeMoSORT achieves state-of-the-art performance, with HOTA scores of 67.9\% and 82.1\%, respectively.
title MeMoSORT: Memory-Assisted Filtering and Motion-Adaptive Association Metric for Multi-Person Tracking
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2508.09796