Saved in:
Bibliographic Details
Main Authors: Quetti, Federico Maria, Figini, Silvia, ballante, Elena
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.08954
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909315085893632
author Quetti, Federico Maria
Figini, Silvia
ballante, Elena
author_facet Quetti, Federico Maria
Figini, Silvia
ballante, Elena
contents The paper presents a novel approach for unsupervised techniques in the field of clustering. A new method is proposed to enhance existing literature models using the proper Bayesian bootstrap to improve results in terms of robustness and interpretability. Our approach is organized in two steps: k-means clustering is used for prior elicitation, then proper Bayesian bootstrap is applied as resampling method in an ensemble clustering approach. Results are analyzed introducing measures of uncertainty based on Shannon entropy. The proposal provides clear indication on the optimal number of clusters, as well as a better representation of the clustered data. Empirical results are provided on simulated data showing the methodological and empirical advances obtained.
format Preprint
id arxiv_https___arxiv_org_abs_2409_08954
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle A Bayesian Approach to Clustering via the Proper Bayesian Bootstrap: the Bayesian Bagged Clustering (BBC) algorithm
Quetti, Federico Maria
Figini, Silvia
ballante, Elena
Machine Learning
The paper presents a novel approach for unsupervised techniques in the field of clustering. A new method is proposed to enhance existing literature models using the proper Bayesian bootstrap to improve results in terms of robustness and interpretability. Our approach is organized in two steps: k-means clustering is used for prior elicitation, then proper Bayesian bootstrap is applied as resampling method in an ensemble clustering approach. Results are analyzed introducing measures of uncertainty based on Shannon entropy. The proposal provides clear indication on the optimal number of clusters, as well as a better representation of the clustered data. Empirical results are provided on simulated data showing the methodological and empirical advances obtained.
title A Bayesian Approach to Clustering via the Proper Bayesian Bootstrap: the Bayesian Bagged Clustering (BBC) algorithm
topic Machine Learning
url https://arxiv.org/abs/2409.08954