TY - JOUR
T1 - Mixtures of common factor analyzers for high-dimensional data with missing information
AU - Wang, Wan Lun
N1 - Funding Information:
The author would like to express her deepest thanks to the Chief Editor, the Associate Editor and three anonymous reviewers for their valuable comments and suggestions that greatly improved this paper. This work was supported by the National Science Council of Taiwan (Grant nos. NSC 100-2118-M-035-002 and NSC 101-2118-M-035-003-MY2 ).
PY - 2013/5
Y1 - 2013/5
N2 - Mixtures of common factor analyzers (MCFA), thought of as a parsimonious extension of mixture factor analyzers (MFA), have recently been developed as a novel approach to analyzing high-dimensional data, where the number of observations n is not very large relative to their dimension p. The key idea behind MCFA is to reduce further the number of parameters in the specification of the component-covariance matrices. An attractive and important feature of MCFA is to allow visualizing data in lower dimensions. The occurrence of missing data persists in many scientific investigations and often complicates data analysis. In this paper, we establish a computationally flexible EM-type algorithm for parameter estimation of the MCFA model with partially observed data. To facilitate the implementation, two auxiliary permutation matrices are incorporated into the estimating procedure for exactly extracting the location of observed and missing components of each observation. Practical issues related to the specification of initial values, model-based clustering and discriminant procedure are also discussed. Our methodology is illustrated through real and simulated examples.
AB - Mixtures of common factor analyzers (MCFA), thought of as a parsimonious extension of mixture factor analyzers (MFA), have recently been developed as a novel approach to analyzing high-dimensional data, where the number of observations n is not very large relative to their dimension p. The key idea behind MCFA is to reduce further the number of parameters in the specification of the component-covariance matrices. An attractive and important feature of MCFA is to allow visualizing data in lower dimensions. The occurrence of missing data persists in many scientific investigations and often complicates data analysis. In this paper, we establish a computationally flexible EM-type algorithm for parameter estimation of the MCFA model with partially observed data. To facilitate the implementation, two auxiliary permutation matrices are incorporated into the estimating procedure for exactly extracting the location of observed and missing components of each observation. Practical issues related to the specification of initial values, model-based clustering and discriminant procedure are also discussed. Our methodology is illustrated through real and simulated examples.
UR - http://www.scopus.com/inward/record.url?scp=84875059316&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84875059316&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2013.02.003
DO - 10.1016/j.jmva.2013.02.003
M3 - Article
AN - SCOPUS:84875059316
SN - 0047-259X
VL - 117
SP - 120
EP - 133
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
ER -