Saved in:
Bibliographic Details
Main Authors: Ghriss, Ayoub, Monteleoni, Claire
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.03405
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915190527754240
author Ghriss, Ayoub
Monteleoni, Claire
author_facet Ghriss, Ayoub
Monteleoni, Claire
contents We propose a novel approach for optimizing the graph ratio-cut by modeling the binary assignments as random variables. We provide an upper bound on the expected ratio-cut, as well as an unbiased estimate of its gradient, to learn the parameters of the assignment variables in an online setting. The clustering resulting from our probabilistic approach (PRCut) outperforms the Rayleigh quotient relaxation of the combinatorial problem, its online learning extensions, and several widely used methods. We demonstrate that the PRCut clustering closely aligns with the similarity measure and can perform as well as a supervised classifier when label-based similarities are provided. This novel approach can leverage out-of-the-box self-supervised representations to achieve competitive performance and serve as an evaluation method for the quality of these representations.
format Preprint
id arxiv_https___arxiv_org_abs_2502_03405
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Deep Clustering via Probabilistic Ratio-Cut Optimization
Ghriss, Ayoub
Monteleoni, Claire
Machine Learning
Computer Vision and Pattern Recognition
We propose a novel approach for optimizing the graph ratio-cut by modeling the binary assignments as random variables. We provide an upper bound on the expected ratio-cut, as well as an unbiased estimate of its gradient, to learn the parameters of the assignment variables in an online setting. The clustering resulting from our probabilistic approach (PRCut) outperforms the Rayleigh quotient relaxation of the combinatorial problem, its online learning extensions, and several widely used methods. We demonstrate that the PRCut clustering closely aligns with the similarity measure and can perform as well as a supervised classifier when label-based similarities are provided. This novel approach can leverage out-of-the-box self-supervised representations to achieve competitive performance and serve as an evaluation method for the quality of these representations.
title Deep Clustering via Probabilistic Ratio-Cut Optimization
topic Machine Learning
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2502.03405