TY - JOUR
T1 - Automated learning of t factor analysis models with complete and incomplete data
AU - Wang, Wan Lun
AU - Castro, Luis M.
AU - Lin, Tsung I.
N1 - Funding Information:
We gratefully acknowledge the Editor-in-Chief, Christian Genest, the Associate Editor and two anonymous referees for their comments and suggestions that greatly improved this paper. We are also grateful to Ms. Yu-Ju Wang for her skillful assistance on preparing the initial manuscript. W.L. Wang and T.I. Lin would like to acknowledge the support of the Ministry of Science and Technology of Taiwan under Grant Nos. MOST 105-2118-M-035-004-MY2 and MOST 105-2118-M-005-003-MY2 , respectively. L.M. Castro acknowledges support from Grant FONDECYT 1170258 from Chilean government.
Publisher Copyright:
© 2017 Elsevier Inc.
PY - 2017/9
Y1 - 2017/9
N2 - The t factor analysis (tFA) model is a promising tool for robust reduction of high-dimensional data in the presence of heavy-tailed noises. When determining the number of factors of the tFA model, a two-stage procedure is commonly performed in which parameter estimation is carried out for a number of candidate models, and then the best model is chosen according to certain penalized likelihood indices such as the Bayesian information criterion. However, the computational burden of such a procedure could be extremely high to achieve the optimal performance, particularly for extensively large data sets. In this paper, we develop a novel automated learning method in which parameter estimation and model selection are seamlessly integrated into a one-stage algorithm. This new scheme is called the automated tFA (AtFA) algorithm, and it is also workable when values are missing. In addition, we derive the Fisher information matrix to approximate the asymptotic covariance matrix associated with the ML estimators of tFA models. Experiments on real and simulated data sets reveal that the AtFA algorithm not only provides identical fitting results, as compared to traditional two-stage procedures, but also runs much faster, especially when values are missing.
AB - The t factor analysis (tFA) model is a promising tool for robust reduction of high-dimensional data in the presence of heavy-tailed noises. When determining the number of factors of the tFA model, a two-stage procedure is commonly performed in which parameter estimation is carried out for a number of candidate models, and then the best model is chosen according to certain penalized likelihood indices such as the Bayesian information criterion. However, the computational burden of such a procedure could be extremely high to achieve the optimal performance, particularly for extensively large data sets. In this paper, we develop a novel automated learning method in which parameter estimation and model selection are seamlessly integrated into a one-stage algorithm. This new scheme is called the automated tFA (AtFA) algorithm, and it is also workable when values are missing. In addition, we derive the Fisher information matrix to approximate the asymptotic covariance matrix associated with the ML estimators of tFA models. Experiments on real and simulated data sets reveal that the AtFA algorithm not only provides identical fitting results, as compared to traditional two-stage procedures, but also runs much faster, especially when values are missing.
UR - http://www.scopus.com/inward/record.url?scp=85028555696&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85028555696&partnerID=8YFLogxK
U2 - 10.1016/j.jmva.2017.07.009
DO - 10.1016/j.jmva.2017.07.009
M3 - Article
AN - SCOPUS:85028555696
SN - 0047-259X
VL - 161
SP - 157
EP - 171
JO - Journal of Multivariate Analysis
JF - Journal of Multivariate Analysis
ER -