Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data

Wan Lun Wang, Tsung I. Lin

研究成果: Article同行評審

摘要

Mixtures of factor analyzers (MFA) based on the restricted skew normal distribution (rMSN) have emerged as a flexible tool to handle asymmetrical high-dimensional data with heterogeneity. However, the rMSN distribution is oft-criticized a lack of sufficient ability to accommodate potential skewness arisen from more than one feature space. This paper presents an alternative extension of MFA by assuming the unrestricted skew normal (uMSN) distribution for the component factors. In particular, the proposed mixtures of unrestricted skew normal factor analyzers (MuSNFA) can simultaneously capture multiple directions of skewness and deal with the occurrence of missing values or nonresponses. Under the missing at random (MAR) mechanism, we develop a computationally feasible expectation conditional maximization (ECM) algorithm for computing the maximum likelihood estimates of model parameters. Practical aspects related to model-based clustering, prediction of factor scores and imputation of missing values are also discussed. The utility of the proposed methodology is illustrated with the analysis of simulated and real datasets.

原文English
頁(從 - 到)787-817
頁數31
期刊Statistical Methods and Applications
32
發行號3
DOIs
出版狀態Published - 2023 9月

All Science Journal Classification (ASJC) codes

  • 統計與概率
  • 統計、概率和不確定性

指紋

深入研究「Model-based clustering via mixtures of unrestricted skew normal factor analyzers with complete and incomplete data」主題。共同形成了獨特的指紋。

引用此