TY - JOUR
T1 - Monitoring photochemical pollutants based on symbolic interval-valued data analysis
AU - Lin, Liang Ching
AU - Guo, Meihui
AU - Lee, Sangyeol
N1 - Publisher Copyright:
© 2022, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2023/12
Y1 - 2023/12
N2 - This study considers monitoring photochemical pollutants for anomaly detection based on symbolic interval-valued data analysis. For this task, we construct control charts based on the principal component scores of symbolic interval-valued data. Herein, the symbolic interval-valued data are assumed to follow a normal distribution, and an approximate expectation formula of order statistics from the normal distribution is used in the univariate case to estimate the mean and variance via the method of moments. In addition, we consider the bivariate case wherein we use the maximum likelihood estimator calculated from the likelihood function derived under a bivariate copula. We also establish the procedures for the statistical control chart based on the univariate and bivariate interval-valued variables, and the procedures are potentially extendable to higher dimensional cases. Monte Carlo simulations and real data analysis using photochemical pollutants confirm the validity of the proposed method. The results particularly show the superiority over the conventional method that uses the averages to identify the date on which the abnormal maximum occurred.
AB - This study considers monitoring photochemical pollutants for anomaly detection based on symbolic interval-valued data analysis. For this task, we construct control charts based on the principal component scores of symbolic interval-valued data. Herein, the symbolic interval-valued data are assumed to follow a normal distribution, and an approximate expectation formula of order statistics from the normal distribution is used in the univariate case to estimate the mean and variance via the method of moments. In addition, we consider the bivariate case wherein we use the maximum likelihood estimator calculated from the likelihood function derived under a bivariate copula. We also establish the procedures for the statistical control chart based on the univariate and bivariate interval-valued variables, and the procedures are potentially extendable to higher dimensional cases. Monte Carlo simulations and real data analysis using photochemical pollutants confirm the validity of the proposed method. The results particularly show the superiority over the conventional method that uses the averages to identify the date on which the abnormal maximum occurred.
UR - http://www.scopus.com/inward/record.url?scp=85141737711&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85141737711&partnerID=8YFLogxK
U2 - 10.1007/s11634-022-00527-1
DO - 10.1007/s11634-022-00527-1
M3 - Article
AN - SCOPUS:85141737711
SN - 1862-5347
VL - 17
SP - 897
EP - 926
JO - Advances in Data Analysis and Classification
JF - Advances in Data Analysis and Classification
IS - 4
ER -