Saved in:
Bibliographic Details
Main Authors: Zhang, Qingxin, Wei, Haoyan, Qian, Yang
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.06460
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912226380611584
author Zhang, Qingxin
Wei, Haoyan
Qian, Yang
author_facet Zhang, Qingxin
Wei, Haoyan
Qian, Yang
contents Group Re-Identification (Group ReID) aims matching groups of pedestrians across non-overlapping cameras. Unlike single-person ReID, Group ReID focuses more on the changes in group structure, emphasizing the number of members and their spatial arrangement. However, most methods rely on certainty-based models, which consider only the specific group structures in the group images, often failing to match unseen group configurations. To this end, we propose a novel Group-CLIP UncertaintyModeling (GCUM) approach that adapts group text descriptions to undetermined accommodate member and layout variations. Specifically, we design a Member Variant Simulation (MVS)module that simulates member exclusions using a Bernoulli distribution and a Group Layout Adaptation (GLA) module that generates uncertain group text descriptions with identity-specific tokens. In addition, we design a Group RelationshipConstruction Encoder (GRCE) that uses group features to refine individual features, and employ cross-modal contrastive loss to obtain generalizable knowledge from group text descriptions. It is worth noting that we are the first to employ CLIP to GroupReID, and extensive experiments show that GCUM significantly outperforms state-of-the-art Group ReID methods.
format Preprint
id arxiv_https___arxiv_org_abs_2502_06460
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Group-CLIP Uncertainty Modeling for Group Re-Identification
Zhang, Qingxin
Wei, Haoyan
Qian, Yang
Computer Vision and Pattern Recognition
Group Re-Identification (Group ReID) aims matching groups of pedestrians across non-overlapping cameras. Unlike single-person ReID, Group ReID focuses more on the changes in group structure, emphasizing the number of members and their spatial arrangement. However, most methods rely on certainty-based models, which consider only the specific group structures in the group images, often failing to match unseen group configurations. To this end, we propose a novel Group-CLIP UncertaintyModeling (GCUM) approach that adapts group text descriptions to undetermined accommodate member and layout variations. Specifically, we design a Member Variant Simulation (MVS)module that simulates member exclusions using a Bernoulli distribution and a Group Layout Adaptation (GLA) module that generates uncertain group text descriptions with identity-specific tokens. In addition, we design a Group RelationshipConstruction Encoder (GRCE) that uses group features to refine individual features, and employ cross-modal contrastive loss to obtain generalizable knowledge from group text descriptions. It is worth noting that we are the first to employ CLIP to GroupReID, and extensive experiments show that GCUM significantly outperforms state-of-the-art Group ReID methods.
title Group-CLIP Uncertainty Modeling for Group Re-Identification
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2502.06460