Automated learning of t factor analysis models with complete and incomplete data

Wan Lun Wang, Luis M. Castro, Tsung I. Lin

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

The t factor analysis (tFA) model is a promising tool for robust reduction of high-dimensional data in the presence of heavy-tailed noises. When determining the number of factors of the tFA model, a two-stage procedure is commonly performed in which parameter estimation is carried out for a number of candidate models, and then the best model is chosen according to certain penalized likelihood indices such as the Bayesian information criterion. However, the computational burden of such a procedure could be extremely high to achieve the optimal performance, particularly for extensively large data sets. In this paper, we develop a novel automated learning method in which parameter estimation and model selection are seamlessly integrated into a one-stage algorithm. This new scheme is called the automated tFA (AtFA) algorithm, and it is also workable when values are missing. In addition, we derive the Fisher information matrix to approximate the asymptotic covariance matrix associated with the ML estimators of tFA models. Experiments on real and simulated data sets reveal that the AtFA algorithm not only provides identical fitting results, as compared to traditional two-stage procedures, but also runs much faster, especially when values are missing.

Original languageEnglish
Pages (from-to)157-171
Number of pages15
JournalJournal of Multivariate Analysis
Volume161
DOIs
Publication statusPublished - 2017 Sept

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Numerical Analysis
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Automated learning of t factor analysis models with complete and incomplete data'. Together they form a unique fingerprint.

Cite this