Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data

Wan Lun Wang, Tsung I. Lin

Research output: Contribution to journalArticlepeer-review

Abstract

Mixtures of factor analyzers (MFA) based on the restricted skew normal distribution (rMSN) have emerged as a flexible tool to handle asymmetrical high-dimensional data with heterogeneity. However, the rMSN distribution is oft-criticized a lack of sufficient ability to accommodate potential skewness arisen from more than one feature space. This paper presents an alternative extension of MFA by assuming the unrestricted skew normal (uMSN) distribution for the component factors. In particular, the proposed mixtures of unrestricted skew normal factor analyzers (MuSNFA) can simultaneously capture multiple directions of skewness and deal with the occurrence of missing values or nonresponses. Under the missing at random (MAR) mechanism, we develop a computationally feasible expectation conditional maximization (ECM) algorithm for computing the maximum likelihood estimates of model parameters. Practical aspects related to model-based clustering, prediction of factor scores and imputation of missing values are also discussed. The utility of the proposed methodology is illustrated with the analysis of simulated and real datasets.

Original languageEnglish
Pages (from-to)787-817
Number of pages31
JournalStatistical Methods and Applications
Volume32
Issue number3
DOIs
Publication statusPublished - 2023 Sept

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data'. Together they form a unique fingerprint.

Cite this