Saved in:
Bibliographic Details
Main Authors: Courrier, Violaine, Biernacki, Christophe
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.05513
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Clustering is viewed as an unsupervised technique, but in practice it requires guidance to uncover meaningful structures. We formalize this with guided clustering, a paradigm that uses a guiding variable to steer the discovery process, and introduce the Guided Clustering Variational Autoencoder (GCVAE) as its deep generative realization. GCVAE learns a latent space structured as a Gaussian Mixture Model by optimizing a variational objective that forces the representation to be maximally informative about the guiding variable. This framework allows the resulting clustering to be reoriented by changing the guiding variable, yielding clusters that are meaningful for the specified context. Experiments on public (MNIST-SVHN) and proprietary connected health devices data demonstrate GCVAE's ability to discover coherent and task-relevant clusters in complex settings.