Saved in:
Bibliographic Details
Main Authors: Wang, Lu, Du, Chao, Zhao, Pu, Luo, Chuan, Zhu, Zhangchi, Qiao, Bo, Zhang, Wei, Lin, Qingwei, Rajmohan, Saravan, Zhang, Dongmei, Zhang, Qi
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.08690
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913198605598720
author Wang, Lu
Du, Chao
Zhao, Pu
Luo, Chuan
Zhu, Zhangchi
Qiao, Bo
Zhang, Wei
Lin, Qingwei
Rajmohan, Saravan
Zhang, Dongmei
Zhang, Qi
author_facet Wang, Lu
Du, Chao
Zhao, Pu
Luo, Chuan
Zhu, Zhangchi
Qiao, Bo
Zhang, Wei
Lin, Qingwei
Rajmohan, Saravan
Zhang, Dongmei
Zhang, Qi
contents As one of the most effective self-supervised representation learning methods, contrastive learning (CL) relies on multiple negative pairs to contrast against each positive pair. In the standard practice of contrastive learning, data augmentation methods are utilized to generate both positive and negative pairs. While existing works have been focusing on improving the positive sampling, the negative sampling process is often overlooked. In fact, the generated negative samples are often polluted by positive samples, which leads to a biased loss and performance degradation. To correct the negative sampling bias, we propose a novel contrastive learning method named Positive-Unlabeled Contrastive Learning (PUCL). PUCL treats the generated negative samples as unlabeled samples and uses information from positive samples to correct bias in contrastive loss. We prove that the corrected loss used in PUCL only incurs a negligible bias compared to the unbiased contrastive loss. PUCL can be applied to general contrastive learning problems and outperforms state-of-the-art methods on various image and graph classification tasks. The code of PUCL is in the supplementary file.
format Preprint
id arxiv_https___arxiv_org_abs_2401_08690
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Contrastive Learning with Negative Sampling Correction
Wang, Lu
Du, Chao
Zhao, Pu
Luo, Chuan
Zhu, Zhangchi
Qiao, Bo
Zhang, Wei
Lin, Qingwei
Rajmohan, Saravan
Zhang, Dongmei
Zhang, Qi
Machine Learning
As one of the most effective self-supervised representation learning methods, contrastive learning (CL) relies on multiple negative pairs to contrast against each positive pair. In the standard practice of contrastive learning, data augmentation methods are utilized to generate both positive and negative pairs. While existing works have been focusing on improving the positive sampling, the negative sampling process is often overlooked. In fact, the generated negative samples are often polluted by positive samples, which leads to a biased loss and performance degradation. To correct the negative sampling bias, we propose a novel contrastive learning method named Positive-Unlabeled Contrastive Learning (PUCL). PUCL treats the generated negative samples as unlabeled samples and uses information from positive samples to correct bias in contrastive loss. We prove that the corrected loss used in PUCL only incurs a negligible bias compared to the unbiased contrastive loss. PUCL can be applied to general contrastive learning problems and outperforms state-of-the-art methods on various image and graph classification tasks. The code of PUCL is in the supplementary file.
title Contrastive Learning with Negative Sampling Correction
topic Machine Learning
url https://arxiv.org/abs/2401.08690