TY - JOUR
T1 - Robust mixture regression modeling based on the normal mean-variance mixture distributions
AU - Naderi, Mehrdad
AU - Mirfarah, Elham
AU - Wang, Wan Lun
AU - Lin, Tsung I.
N1 - Funding Information:
The authors gratefully acknowledge the Co-Editors, the Associate Editor and two anonymous referees for their comments and suggestions that significantly improved the quality of this paper. This research was supported by the National Science and Technology Council of Taiwan under Grant Nos. 110-2811-M-005-510-MY2 , 111-2811-M-006-006-MY2 , 110-2118-M-006-006-MY3 and 109-2118-M-005-005-MY3 .
Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2023/4
Y1 - 2023/4
N2 - Mixture regression models (MRMs) are widely used to capture the heterogeneity of relationships between the response variable and one or more predictors coming from several non-homogeneous groups. Since the conventional MRMs are quite sensitive to departures from normality caused by extra skewness and possible heavy tails, various extensions built on more flexible distributions have been put forward in the last decade. The class of normal mean-variance mixture (NMVM) distributions that arise from scaling both the mean and variance of a normal random variable with a common mixing distribution encompasses many prominent (symmetric or asymmetrical) distributions as special cases. A unified approach to robustifying MRMs is proposed by considering the class of NMVM distributions for component errors. An expectation conditional maximization either (ECME) algorithm, which incorporates membership indicators and the latent scaling variables as the missing data, is developed for carrying out maximum likelihood (ML) estimation of model parameters. Four simulation studies are conducted to examine the finite-sample property of ML estimators and the robustness of the proposed model against outliers for contaminated and noisy data. The usefulness and superiority of our methodology are demonstrated through applications to two real datasets.
AB - Mixture regression models (MRMs) are widely used to capture the heterogeneity of relationships between the response variable and one or more predictors coming from several non-homogeneous groups. Since the conventional MRMs are quite sensitive to departures from normality caused by extra skewness and possible heavy tails, various extensions built on more flexible distributions have been put forward in the last decade. The class of normal mean-variance mixture (NMVM) distributions that arise from scaling both the mean and variance of a normal random variable with a common mixing distribution encompasses many prominent (symmetric or asymmetrical) distributions as special cases. A unified approach to robustifying MRMs is proposed by considering the class of NMVM distributions for component errors. An expectation conditional maximization either (ECME) algorithm, which incorporates membership indicators and the latent scaling variables as the missing data, is developed for carrying out maximum likelihood (ML) estimation of model parameters. Four simulation studies are conducted to examine the finite-sample property of ML estimators and the robustness of the proposed model against outliers for contaminated and noisy data. The usefulness and superiority of our methodology are demonstrated through applications to two real datasets.
UR - http://www.scopus.com/inward/record.url?scp=85147916824&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147916824&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2022.107661
DO - 10.1016/j.csda.2022.107661
M3 - Article
AN - SCOPUS:85147916824
SN - 0167-9473
VL - 180
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
M1 - 107661
ER -