Saved in:
Bibliographic Details
Main Authors: Yue, Yun, Lin, Fangzhou, Mou, Guanyi, Zhang, Ziming
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.15523
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910433091256320
author Yue, Yun
Lin, Fangzhou
Mou, Guanyi
Zhang, Ziming
author_facet Yue, Yun
Lin, Fangzhou
Mou, Guanyi
Zhang, Ziming
contents In recent years, there has been a growing trend of incorporating hyperbolic geometry methods into computer vision. While these methods have achieved state-of-the-art performance on various metric learning tasks using hyperbolic distance measurements, the underlying theoretical analysis supporting this superior performance remains under-exploited. In this study, we investigate the effects of integrating hyperbolic space into metric learning, particularly when training with contrastive loss. We identify a need for a comprehensive comparison between Euclidean and hyperbolic spaces regarding the temperature effect in the contrastive loss within the existing literature. To address this gap, we conduct an extensive investigation to benchmark the results of Vision Transformers (ViTs) using a hybrid objective function that combines loss from Euclidean and hyperbolic spaces. Additionally, we provide a theoretical analysis of the observed performance improvement. We also reveal that hyperbolic metric learning is highly related to hard negative sampling, providing insights for future work. This work will provide valuable data points and experience in understanding hyperbolic image embeddings. To shed more light on problem-solving and encourage further investigation into our approach, our code is available online (https://github.com/YunYunY/HypMix).
format Preprint
id arxiv_https___arxiv_org_abs_2404_15523
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Understanding Hyperbolic Metric Learning through Hard Negative Sampling
Yue, Yun
Lin, Fangzhou
Mou, Guanyi
Zhang, Ziming
Computer Vision and Pattern Recognition
In recent years, there has been a growing trend of incorporating hyperbolic geometry methods into computer vision. While these methods have achieved state-of-the-art performance on various metric learning tasks using hyperbolic distance measurements, the underlying theoretical analysis supporting this superior performance remains under-exploited. In this study, we investigate the effects of integrating hyperbolic space into metric learning, particularly when training with contrastive loss. We identify a need for a comprehensive comparison between Euclidean and hyperbolic spaces regarding the temperature effect in the contrastive loss within the existing literature. To address this gap, we conduct an extensive investigation to benchmark the results of Vision Transformers (ViTs) using a hybrid objective function that combines loss from Euclidean and hyperbolic spaces. Additionally, we provide a theoretical analysis of the observed performance improvement. We also reveal that hyperbolic metric learning is highly related to hard negative sampling, providing insights for future work. This work will provide valuable data points and experience in understanding hyperbolic image embeddings. To shed more light on problem-solving and encourage further investigation into our approach, our code is available online (https://github.com/YunYunY/HypMix).
title Understanding Hyperbolic Metric Learning through Hard Negative Sampling
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.15523