Saved in:
Bibliographic Details
Main Authors: Khoufache, Reda, Belhadj, Anisse, Azzag, Hanene, Lebbah, Mustapha
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.01050
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911769588400128
author Khoufache, Reda
Belhadj, Anisse
Azzag, Hanene
Lebbah, Mustapha
author_facet Khoufache, Reda
Belhadj, Anisse
Azzag, Hanene
Lebbah, Mustapha
contents In this paper, we introduce a novel Distributed Markov Chain Monte Carlo (MCMC) inference method for the Bayesian Non-Parametric Latent Block Model (DisNPLBM), employing the Master/Worker architecture. Our non-parametric co-clustering algorithm divides observations and features into partitions using latent multivariate Gaussian block distributions. The workload on rows is evenly distributed among workers, who exclusively communicate with the master and not among themselves. DisNPLBM demonstrates its impact on cluster labeling accuracy and execution times through experimental results. Moreover, we present a real-use case applying our approach to co-cluster gene expression data. The code source is publicly available at https://github.com/redakhoufache/Distributed-NPLBM.
format Preprint
id arxiv_https___arxiv_org_abs_2402_01050
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Distributed MCMC inference for Bayesian Non-Parametric Latent Block Model
Khoufache, Reda
Belhadj, Anisse
Azzag, Hanene
Lebbah, Mustapha
Machine Learning
Computation
In this paper, we introduce a novel Distributed Markov Chain Monte Carlo (MCMC) inference method for the Bayesian Non-Parametric Latent Block Model (DisNPLBM), employing the Master/Worker architecture. Our non-parametric co-clustering algorithm divides observations and features into partitions using latent multivariate Gaussian block distributions. The workload on rows is evenly distributed among workers, who exclusively communicate with the master and not among themselves. DisNPLBM demonstrates its impact on cluster labeling accuracy and execution times through experimental results. Moreover, we present a real-use case applying our approach to co-cluster gene expression data. The code source is publicly available at https://github.com/redakhoufache/Distributed-NPLBM.
title Distributed MCMC inference for Bayesian Non-Parametric Latent Block Model
topic Machine Learning
Computation
url https://arxiv.org/abs/2402.01050