Saved in:
Bibliographic Details
Main Authors: Shin, Hwasoo, Ferreira, Marco A. R., Tegge, Allison N.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.05280
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910932944289792
author Shin, Hwasoo
Ferreira, Marco A. R.
Tegge, Allison N.
author_facet Shin, Hwasoo
Ferreira, Marco A. R.
Tegge, Allison N.
contents We present a novel framework for concomitant dimension reduction and clustering. This framework is based on a novel class of Bayesian clustering factor models. These models assume a factor model structure where the vectors of common factors follow a mixture of Gaussian distributions. We develop a Gibbs sampler to explore the posterior distribution and propose an information criterion to select the number of clusters and the number of factors. Simulation studies show that our inferential approach appropriately quantifies uncertainty. In addition, when compared to a previously published competitor method, our information criterion has favorable performance in terms of correct selection of number of clusters and number of factors. Finally, we illustrate the capabilities of our framework with an application to data on recovery from opioid use disorder where clustering of individuals may facilitate personalized health care.
format Preprint
id arxiv_https___arxiv_org_abs_2505_05280
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Bayesian Clustering Factor Models
Shin, Hwasoo
Ferreira, Marco A. R.
Tegge, Allison N.
Methodology
We present a novel framework for concomitant dimension reduction and clustering. This framework is based on a novel class of Bayesian clustering factor models. These models assume a factor model structure where the vectors of common factors follow a mixture of Gaussian distributions. We develop a Gibbs sampler to explore the posterior distribution and propose an information criterion to select the number of clusters and the number of factors. Simulation studies show that our inferential approach appropriately quantifies uncertainty. In addition, when compared to a previously published competitor method, our information criterion has favorable performance in terms of correct selection of number of clusters and number of factors. Finally, we illustrate the capabilities of our framework with an application to data on recovery from opioid use disorder where clustering of individuals may facilitate personalized health care.
title Bayesian Clustering Factor Models
topic Methodology
url https://arxiv.org/abs/2505.05280