Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ha, Ruiyang, Jiang, Songyi, Li, Bin, Pan, Bikang, Zhu, Yihang, Zhang, Junjie, Zhu, Xiatian, Gong, Shaogang, Wang, Jingya
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2503.17096
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866913753408208896
author Ha, Ruiyang
Jiang, Songyi
Li, Bin
Pan, Bikang
Zhu, Yihang
Zhang, Junjie
Zhu, Xiatian
Gong, Shaogang
Wang, Jingya
author_facet Ha, Ruiyang
Jiang, Songyi
Li, Bin
Pan, Bikang
Zhu, Yihang
Zhang, Junjie
Zhu, Xiatian
Gong, Shaogang
Wang, Jingya
contents Conventional person re-identification (ReID) research is often limited to single-modality sensor data from static cameras, which fails to address the complexities of real-world scenarios where multi-modal signals are increasingly prevalent. For instance, consider an urban ReID system integrating stationary RGB cameras, nighttime infrared sensors, and UAVs equipped with dynamic tracking capabilities. Such systems face significant challenges due to variations in camera perspectives, lighting conditions, and sensor modalities, hindering effective person ReID. To address these challenges, we introduce the MP-ReID benchmark, a novel dataset designed specifically for multi-modality and multi-platform ReID. This benchmark uniquely compiles data from 1,930 identities across diverse modalities, including RGB, infrared, and thermal imaging, captured by both UAVs and ground-based cameras in indoor and outdoor environments. Building on this benchmark, we introduce Uni-Prompt ReID, a framework with specific-designed prompts, tailored for cross-modality and cross-platform scenarios. Our method consistently outperforms state-of-the-art approaches, establishing a robust foundation for future research in complex and dynamic ReID environments. Our dataset are available at:https://mp-reid.github.io/.
format Preprint
id arxiv_https___arxiv_org_abs_2503_17096
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Multi-modal Multi-platform Person Re-Identification: Benchmark and Method
Ha, Ruiyang
Jiang, Songyi
Li, Bin
Pan, Bikang
Zhu, Yihang
Zhang, Junjie
Zhu, Xiatian
Gong, Shaogang
Wang, Jingya
Computer Vision and Pattern Recognition
Conventional person re-identification (ReID) research is often limited to single-modality sensor data from static cameras, which fails to address the complexities of real-world scenarios where multi-modal signals are increasingly prevalent. For instance, consider an urban ReID system integrating stationary RGB cameras, nighttime infrared sensors, and UAVs equipped with dynamic tracking capabilities. Such systems face significant challenges due to variations in camera perspectives, lighting conditions, and sensor modalities, hindering effective person ReID. To address these challenges, we introduce the MP-ReID benchmark, a novel dataset designed specifically for multi-modality and multi-platform ReID. This benchmark uniquely compiles data from 1,930 identities across diverse modalities, including RGB, infrared, and thermal imaging, captured by both UAVs and ground-based cameras in indoor and outdoor environments. Building on this benchmark, we introduce Uni-Prompt ReID, a framework with specific-designed prompts, tailored for cross-modality and cross-platform scenarios. Our method consistently outperforms state-of-the-art approaches, establishing a robust foundation for future research in complex and dynamic ReID environments. Our dataset are available at:https://mp-reid.github.io/.
title Multi-modal Multi-platform Person Re-Identification: Benchmark and Method
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2503.17096