Saved in:
Bibliographic Details
Main Authors: Hansen, Colin B., Nath, Vishwesh, Mesa, Diego A., Huo, Yuankai, Landman, Bennett A., Lasko, Thomas A.
Format: Preprint
Published: 2020
Subjects:
Online Access:https://arxiv.org/abs/2003.07921
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929370065534976
author Hansen, Colin B.
Nath, Vishwesh
Mesa, Diego A.
Huo, Yuankai
Landman, Bennett A.
Lasko, Thomas A.
author_facet Hansen, Colin B.
Nath, Vishwesh
Mesa, Diego A.
Huo, Yuankai
Landman, Bennett A.
Lasko, Thomas A.
contents In semi-supervised learning, information from unlabeled examples is used to improve the model learned from labeled examples. In some learning problems, partial label information can be inferred from otherwise unlabeled examples and used to further improve the model. In particular, partial label information exists when subsets of training examples are known to have the same label, even though the label itself is missing. By encouraging the model to give the same label to all such examples through contrastive learning objectives, we can potentially improve its performance. We call this encouragement Nullspace Tuning because the difference vector between any pair of examples with the same label should lie in the nullspace of a linear model. In this paper, we investigate the benefit of using partial label information using a careful comparison framework over well-characterized public datasets. We show that the additional information provided by partial labels reduces test error over good semi-supervised methods usually by a factor of 2, up to a factor of 5.5 in the best case. We also show that adding Nullspace Tuning to the newer and state-of-the-art MixMatch method decreases its test error by up to a factor of 1.8.
format Preprint
id arxiv_https___arxiv_org_abs_2003_07921
institution arXiv
publishDate 2020
record_format arxiv
spellingShingle Semi-supervised Contrastive Learning Using Partial Label Information
Hansen, Colin B.
Nath, Vishwesh
Mesa, Diego A.
Huo, Yuankai
Landman, Bennett A.
Lasko, Thomas A.
Machine Learning
In semi-supervised learning, information from unlabeled examples is used to improve the model learned from labeled examples. In some learning problems, partial label information can be inferred from otherwise unlabeled examples and used to further improve the model. In particular, partial label information exists when subsets of training examples are known to have the same label, even though the label itself is missing. By encouraging the model to give the same label to all such examples through contrastive learning objectives, we can potentially improve its performance. We call this encouragement Nullspace Tuning because the difference vector between any pair of examples with the same label should lie in the nullspace of a linear model. In this paper, we investigate the benefit of using partial label information using a careful comparison framework over well-characterized public datasets. We show that the additional information provided by partial labels reduces test error over good semi-supervised methods usually by a factor of 2, up to a factor of 5.5 in the best case. We also show that adding Nullspace Tuning to the newer and state-of-the-art MixMatch method decreases its test error by up to a factor of 1.8.
title Semi-supervised Contrastive Learning Using Partial Label Information
topic Machine Learning
url https://arxiv.org/abs/2003.07921