Flexible clustering via extended mixtures of common t-factor analyzers

Wan Lun Wang, Tsung I. Lin

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Mixtures of t-factor analyzers have been broadly used for model-based density estimation and clustering of high-dimensional data from a heterogeneous population with longer-than-normal tails or atypical observations. To reduce the number of parameters in the component covariance matrices, the mixtures of common t-factor analyzers (MCtFA) have been recently proposed by assuming a common factor loading across different components. In this paper, we present an extended version of MCtFA using distinct covariance matrices for component errors. The modified mixture model offers a more appropriate way to represent the data in a graphical fashion. Two flexible EM-type algorithms are developed for iteratively computing maximum likelihood estimates of parameters. Practical considerations for the specification of starting values, model-based clustering, classification of new subject and identification of potential outliers are also provided. We demonstrate the superiority of the proposed methodology by analyzing the Italian wine data and a simulation study.

Original languageEnglish
Pages (from-to)227-252
Number of pages26
JournalAStA Advances in Statistical Analysis
Volume101
Issue number3
DOIs
Publication statusPublished - 2017 Jul 1

All Science Journal Classification (ASJC) codes

  • Analysis
  • Statistics and Probability
  • Modelling and Simulation
  • Social Sciences (miscellaneous)
  • Economics and Econometrics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Flexible clustering via extended mixtures of common t-factor analyzers'. Together they form a unique fingerprint.

Cite this