Saved in:
Bibliographic Details
Main Authors: Katz, Nadav, Jaffe, Ariel
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.08042
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917257846718464
author Katz, Nadav
Jaffe, Ariel
author_facet Katz, Nadav
Jaffe, Ariel
contents Semi-supervised learning (SSL) addresses the critical challenge of training accurate models when labeled data is scarce but unlabeled data is abundant. Graph-based SSL (GSSL) has emerged as a popular framework that captures data structure through graph representations. Classic graph SSL methods, such as Label Propagation and Label Spreading, aim to compute low-dimensional representations where points with the same labels are close in representation space. Although often effective, these methods can be suboptimal on data with complex label distributions. In our work, we develop AUC-spec, a graph approach that computes a low-dimensional representation that maximizes class separation. We compute this representation by optimizing the Area Under the ROC Curve (AUC) as estimated via the labeled points. We provide a detailed analysis of our approach under a product-of-manifold model, and show that the required number of labeled points for AUC-spec is polynomial in the model parameters. Empirically, we show that AUC-spec balances class separation with graph smoothness. It demonstrates competitive results on synthetic and real-world datasets while maintaining computational efficiency comparable to the field's classic and state-of-the-art methods.
format Preprint
id arxiv_https___arxiv_org_abs_2602_08042
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Graph-based Semi-Supervised Learning via Maximum Discrimination
Katz, Nadav
Jaffe, Ariel
Machine Learning
Semi-supervised learning (SSL) addresses the critical challenge of training accurate models when labeled data is scarce but unlabeled data is abundant. Graph-based SSL (GSSL) has emerged as a popular framework that captures data structure through graph representations. Classic graph SSL methods, such as Label Propagation and Label Spreading, aim to compute low-dimensional representations where points with the same labels are close in representation space. Although often effective, these methods can be suboptimal on data with complex label distributions. In our work, we develop AUC-spec, a graph approach that computes a low-dimensional representation that maximizes class separation. We compute this representation by optimizing the Area Under the ROC Curve (AUC) as estimated via the labeled points. We provide a detailed analysis of our approach under a product-of-manifold model, and show that the required number of labeled points for AUC-spec is polynomial in the model parameters. Empirically, we show that AUC-spec balances class separation with graph smoothness. It demonstrates competitive results on synthetic and real-world datasets while maintaining computational efficiency comparable to the field's classic and state-of-the-art methods.
title Graph-based Semi-Supervised Learning via Maximum Discrimination
topic Machine Learning
url https://arxiv.org/abs/2602.08042