Robust clustering of multiply censored data via mixtures of t factor analyzers

Wan Lun Wang, Tsung I. Lin

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Mixtures of t factor analyzers (MtFA) have been well recognized as a prominent tool in modeling and clustering multivariate data contaminated with heterogeneity and outliers. In certain practical situations, however, data are likely to be censored such that the standard methodology becomes computationally complicated or even infeasible. This paper presents an extended framework of MtFA that can accommodate censored data, referred to as MtFAC in short. For maximum likelihood estimation, we construct an alternating expectation conditional maximization algorithm in which the E-step relies on the first-two moments of truncated multivariate-t distributions and CM-steps offer tractable solutions of updated estimators. Asymptotic standard errors of mixing proportions and component mean vectors are derived by means of missing information principle, or the so-called Louis’ method. Several numerical experiments are conducted to examine the finite-sample properties of estimators and the ability of the proposed model to downweight the impact of censoring and outlying effects. Further, the efficacy and usefulness of the proposed method are also demonstrated by analyzing a real dataset with genuine censored observations.

Original languageEnglish
Pages (from-to)22-53
Number of pages32
JournalTest
Volume31
Issue number1
DOIs
Publication statusPublished - 2022 Mar

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Robust clustering of multiply censored data via mixtures of t factor analyzers'. Together they form a unique fingerprint.

Cite this