Abstract
This article introduces a robust extension of the mixture of factor analysis models based on the restricted multivariate skew-t distribution, called mixtures of skew-t factor analysis (MSTFA) model. This model can be viewed as a powerful tool for model-based clustering of high-dimensional data where observations in each cluster exhibit non-normal features such as heavy-tailed noises and extreme skewness. Missing values may be frequently present due to the incomplete collection of data. A computationally feasible EM-type algorithm is developed to carry out maximum likelihood estimation and create single imputation of possible missing values under a missing at random mechanism. The numbers of factors and mixture components are determined via penalized likelihood criteria. The utility of our proposed methodology is illustrated through analysing both simulated and real datasets. Numerical results are shown to perform favourably compared to existing approaches.
Original language | English |
---|---|
Pages (from-to) | 50-72 |
Number of pages | 23 |
Journal | Statistical Modelling |
Volume | 18 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2018 Feb 1 |
All Science Journal Classification (ASJC) codes
- Statistics and Probability
- Statistics, Probability and Uncertainty