Saved in:
Bibliographic Details
Main Authors: Ren, Kaijie, Zhang, Lei
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.11708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910383870050304
author Ren, Kaijie
Zhang, Lei
author_facet Ren, Kaijie
Zhang, Lei
contents Visible-Infrared Person Re-identification (VI-ReID) is a challenging cross-modal pedestrian retrieval task, due to significant intra-class variations and cross-modal discrepancies among different cameras. Existing works mainly focus on embedding images of different modalities into a unified space to mine modality-shared features. They only seek distinctive information within these shared features, while ignoring the identity-aware useful information that is implicit in the modality-specific features. To address this issue, we propose a novel Implicit Discriminative Knowledge Learning (IDKL) network to uncover and leverage the implicit discriminative information contained within the modality-specific. First, we extract modality-specific and modality-shared features using a novel dual-stream network. Then, the modality-specific features undergo purification to reduce their modality style discrepancies while preserving identity-aware discriminative knowledge. Subsequently, this kind of implicit knowledge is distilled into the modality-shared feature to enhance its distinctiveness. Finally, an alignment loss is proposed to minimize modality discrepancy on enhanced modality-shared features. Extensive experiments on multiple public datasets demonstrate the superiority of IDKL network over the state-of-the-art methods. Code is available at https://github.com/1KK077/IDKL.
format Preprint
id arxiv_https___arxiv_org_abs_2403_11708
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification
Ren, Kaijie
Zhang, Lei
Computer Vision and Pattern Recognition
Visible-Infrared Person Re-identification (VI-ReID) is a challenging cross-modal pedestrian retrieval task, due to significant intra-class variations and cross-modal discrepancies among different cameras. Existing works mainly focus on embedding images of different modalities into a unified space to mine modality-shared features. They only seek distinctive information within these shared features, while ignoring the identity-aware useful information that is implicit in the modality-specific features. To address this issue, we propose a novel Implicit Discriminative Knowledge Learning (IDKL) network to uncover and leverage the implicit discriminative information contained within the modality-specific. First, we extract modality-specific and modality-shared features using a novel dual-stream network. Then, the modality-specific features undergo purification to reduce their modality style discrepancies while preserving identity-aware discriminative knowledge. Subsequently, this kind of implicit knowledge is distilled into the modality-shared feature to enhance its distinctiveness. Finally, an alignment loss is proposed to minimize modality discrepancy on enhanced modality-shared features. Extensive experiments on multiple public datasets demonstrate the superiority of IDKL network over the state-of-the-art methods. Code is available at https://github.com/1KK077/IDKL.
title Implicit Discriminative Knowledge Learning for Visible-Infrared Person Re-Identification
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2403.11708