Saved in:
Bibliographic Details
Main Authors: Zhang, Qin, Xu, Linghan, Tang, Qingming, Fang, Jun, Wu, Ying Nian, Tighe, Joe, Xing, Yifan
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2307.04047
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914711193255936
author Zhang, Qin
Xu, Linghan
Tang, Qingming
Fang, Jun
Wu, Ying Nian
Tighe, Joe
Xing, Yifan
author_facet Zhang, Qin
Xu, Linghan
Tang, Qingming
Fang, Jun
Wu, Ying Nian
Tighe, Joe
Xing, Yifan
contents Existing losses used in deep metric learning (DML) for image retrieval often lead to highly non-uniform intra-class and inter-class representation structures across test classes and data distributions. When combined with the common practice of using a fixed threshold to declare a match, this gives rise to significant performance variations in terms of false accept rate (FAR) and false reject rate (FRR) across test classes and data distributions. We define this issue in DML as threshold inconsistency. In real-world applications, such inconsistency often complicates the threshold selection process when deploying commercial image retrieval systems. To measure this inconsistency, we propose a novel variance-based metric called Operating-Point-Inconsistency-Score (OPIS) that quantifies the variance in the operating characteristics across classes. Using the OPIS metric, we find that achieving high accuracy levels in a DML model does not automatically guarantee threshold consistency. In fact, our investigation reveals a Pareto frontier in the high-accuracy regime, where existing methods to improve accuracy often lead to degradation in threshold consistency. To address this trade-off, we introduce the Threshold-Consistent Margin (TCM) loss, a simple yet effective regularization technique that promotes uniformity in representation structures across classes by selectively penalizing hard sample pairs. Extensive experiments demonstrate TCM's effectiveness in enhancing threshold consistency while preserving accuracy, simplifying the threshold selection process in practical DML settings.
format Preprint
id arxiv_https___arxiv_org_abs_2307_04047
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Threshold-Consistent Margin Loss for Open-World Deep Metric Learning
Zhang, Qin
Xu, Linghan
Tang, Qingming
Fang, Jun
Wu, Ying Nian
Tighe, Joe
Xing, Yifan
Computer Vision and Pattern Recognition
Existing losses used in deep metric learning (DML) for image retrieval often lead to highly non-uniform intra-class and inter-class representation structures across test classes and data distributions. When combined with the common practice of using a fixed threshold to declare a match, this gives rise to significant performance variations in terms of false accept rate (FAR) and false reject rate (FRR) across test classes and data distributions. We define this issue in DML as threshold inconsistency. In real-world applications, such inconsistency often complicates the threshold selection process when deploying commercial image retrieval systems. To measure this inconsistency, we propose a novel variance-based metric called Operating-Point-Inconsistency-Score (OPIS) that quantifies the variance in the operating characteristics across classes. Using the OPIS metric, we find that achieving high accuracy levels in a DML model does not automatically guarantee threshold consistency. In fact, our investigation reveals a Pareto frontier in the high-accuracy regime, where existing methods to improve accuracy often lead to degradation in threshold consistency. To address this trade-off, we introduce the Threshold-Consistent Margin (TCM) loss, a simple yet effective regularization technique that promotes uniformity in representation structures across classes by selectively penalizing hard sample pairs. Extensive experiments demonstrate TCM's effectiveness in enhancing threshold consistency while preserving accuracy, simplifying the threshold selection process in practical DML settings.
title Threshold-Consistent Margin Loss for Open-World Deep Metric Learning
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2307.04047