Saved in:
Bibliographic Details
Main Authors: Avrachenkov, Konstantin, Dreveton, Maximilien
Format: Preprint
Published: 2020
Subjects:
Online Access:https://arxiv.org/abs/2007.14717
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929697263190016
author Avrachenkov, Konstantin
Dreveton, Maximilien
author_facet Avrachenkov, Konstantin
Dreveton, Maximilien
contents Graph-based semi-supervised learning methods combine the graph structure and labeled data to classify unlabeled data. In this work, we study the effect of a noisy oracle on classification. In particular, we derive the Maximum A Posteriori (MAP) estimator for clustering a Degree Corrected Stochastic Block Model (DC-SBM) when a noisy oracle reveals a fraction of the labels. We then propose an algorithm derived from a continuous relaxation of the MAP, and we establish its consistency. Numerical experiments show that our approach achieves promising performance on synthetic and real data sets, even in the case of very noisy labeled data.
format Preprint
id arxiv_https___arxiv_org_abs_2007_14717
institution arXiv
publishDate 2020
record_format arxiv
spellingShingle Almost exact recovery in noisy semi-supervised learning
Avrachenkov, Konstantin
Dreveton, Maximilien
Machine Learning
Statistics Theory
62F12, 62H30, 68T10
Graph-based semi-supervised learning methods combine the graph structure and labeled data to classify unlabeled data. In this work, we study the effect of a noisy oracle on classification. In particular, we derive the Maximum A Posteriori (MAP) estimator for clustering a Degree Corrected Stochastic Block Model (DC-SBM) when a noisy oracle reveals a fraction of the labels. We then propose an algorithm derived from a continuous relaxation of the MAP, and we establish its consistency. Numerical experiments show that our approach achieves promising performance on synthetic and real data sets, even in the case of very noisy labeled data.
title Almost exact recovery in noisy semi-supervised learning
topic Machine Learning
Statistics Theory
62F12, 62H30, 68T10
url https://arxiv.org/abs/2007.14717