Saved in:
Bibliographic Details
Main Authors: Kasa, Siva Rajesh, Yijie, Hu, Kasa, Santhosh Kumar, Rajan, Vaibhav
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.10229
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911778128003072
author Kasa, Siva Rajesh
Yijie, Hu
Kasa, Santhosh Kumar
Rajan, Vaibhav
author_facet Kasa, Siva Rajesh
Yijie, Hu
Kasa, Santhosh Kumar
Rajan, Vaibhav
contents \texttt{Mixture-Models} is an open-source Python library for fitting Gaussian Mixture Models (GMM) and their variants, such as Parsimonious GMMs, Mixture of Factor Analyzers, MClust models, Mixture of Student's t distributions, etc. It streamlines the implementation and analysis of these models using various first/second order optimization routines such as Gradient Descent and Newton-CG through automatic differentiation (AD) tools. This helps in extending these models to high-dimensional data, which is first of its kind among Python libraries. The library provides user-friendly model evaluation tools, such as BIC, AIC, and log-likelihood estimation. The source-code is licensed under MIT license and can be accessed at \url{https://github.com/kasakh/Mixture-Models}. The package is highly extensible, allowing users to incorporate new distributions and optimization techniques with ease. We conduct a large scale simulation to compare the performance of various gradient based approaches against Expectation Maximization on a wide range of settings and identify the corresponding best suited approach.
format Preprint
id arxiv_https___arxiv_org_abs_2402_10229
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Mixture-Models: a one-stop Python Library for Model-based Clustering using various Mixture Models
Kasa, Siva Rajesh
Yijie, Hu
Kasa, Santhosh Kumar
Rajan, Vaibhav
Computation
Machine Learning
\texttt{Mixture-Models} is an open-source Python library for fitting Gaussian Mixture Models (GMM) and their variants, such as Parsimonious GMMs, Mixture of Factor Analyzers, MClust models, Mixture of Student's t distributions, etc. It streamlines the implementation and analysis of these models using various first/second order optimization routines such as Gradient Descent and Newton-CG through automatic differentiation (AD) tools. This helps in extending these models to high-dimensional data, which is first of its kind among Python libraries. The library provides user-friendly model evaluation tools, such as BIC, AIC, and log-likelihood estimation. The source-code is licensed under MIT license and can be accessed at \url{https://github.com/kasakh/Mixture-Models}. The package is highly extensible, allowing users to incorporate new distributions and optimization techniques with ease. We conduct a large scale simulation to compare the performance of various gradient based approaches against Expectation Maximization on a wide range of settings and identify the corresponding best suited approach.
title Mixture-Models: a one-stop Python Library for Model-based Clustering using various Mixture Models
topic Computation
Machine Learning
url https://arxiv.org/abs/2402.10229