Model-based clustering of censored data via mixtures of factor analyzers

Wan Lun Wang, Luis M. Castro, Victor H. Lachos, Tsung I. Lin

Research output: Contribution to journalArticlepeer-review

15 Citations (Scopus)

Abstract

Mixtures of factor analyzers (MFA) provide a promising tool for modeling and clustering high-dimensional data that contain an overwhelmingly large number of attributes measured on individuals arisen from a heterogeneous population. Due to the restriction of experimental apparatus, measurements can be limited to some lower and/or upper detection bounds and thus the data are possibly censored. In this paper, we extend the MFA to accommodate censored data, and the new model is called the MFA with censoring (MFAC). A computationally feasible alternating expectation conditional maximization (AECM) algorithm is developed to carry out maximum likelihood estimation of the MFAC model. Practical issues related to model-based clustering and recovery of censored data are also discussed. Simulation studies are conducted to examine the effect of censoring in classification, estimation and cluster validation. We also present an application of the proposed approach to two real data examples in which a certain number of left-censored observations are present.

Original languageEnglish
Pages (from-to)104-121
Number of pages18
JournalComputational Statistics and Data Analysis
Volume140
DOIs
Publication statusPublished - 2019 Dec

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Computational Mathematics
  • Computational Theory and Mathematics
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Model-based clustering of censored data via mixtures of factor analyzers'. Together they form a unique fingerprint.

Cite this