Saved in:
Bibliographic Details
Main Authors: Flynt, Abby, Dean, Nema
Format: Recurso educativo Open Access
Language:en
Published: 2016
Subjects:
Online Access:https://eric.ed.gov/?id=EJ1094377
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • A Survey of Popular R Packages for Cluster Analysis Flynt, Abby Dean, Nema Multivariate Analysis Computer Software Comparative Analysis Programming Languages Models Cluster analysis is a set of statistical methods for discovering new group/class structure when exploring data sets. This article reviews the following popular libraries/commands in the R software language for applying different types of cluster analysis: from the stats library, the kmeans, and hclust functions; the mclust library; the poLCA library; and the clustMD library. The packages/functions cover a variety of cluster analysis methods for continuous data, categorical data, or a collection of the two. The contrasting methods in the different packages are briefly introduced, and basic usage of the functions is discussed. The use of the different methods is compared and contrasted and then illustrated on example data. In the discussion, links to information on other available libraries for different clustering methods and extensions beyond basic clustering methods are given. The code for the worked examples in Section 2 is available at http://www.stats.gla.ac.uk/~nd29c/Software/ClusterReviewCode.R