TY - JOUR
T1 - Across-Platform Imputation of DNA Methylation Levels Incorporating Nonlocal Information Using Penalized Functional Regression
AU - Zhang, Guosheng
AU - Huang, Kuan Chieh
AU - Xu, Zheng
AU - Tzeng, Jung Ying
AU - Conneely, Karen N.
AU - Guan, Weihua
AU - Kang, Jian
AU - Li, Yun
N1 - Publisher Copyright:
© 2016 Wiley Periodicals, Inc.
PY - 2016/5/1
Y1 - 2016/5/1
N2 - DNA methylation is a key epigenetic mark involved in both normal development and disease progression. Recent advances in high-throughput technologies have enabled genome-wide profiling of DNA methylation. However, DNA methylation profiling often employs different designs and platforms with varying resolution, which hinders joint analysis of methylation data from multiple platforms. In this study, we propose a penalized functional regression model to impute missing methylation data. By incorporating functional predictors, our model utilizes information from nonlocal probes to improve imputation quality. Here, we compared the performance of our functional model to linear regression and the best single probe surrogate in real data and via simulations. Specifically, we applied different imputation approaches to an acute myeloid leukemia dataset consisting of 194 samples and our method showed higher imputation accuracy, manifested, for example, by a 94% relative increase in information content and up to 86% more CpG sites passing post-imputation filtering. Our simulated association study further demonstrated that our method substantially improves the statistical power to identify trait-associated methylation loci. These findings indicate that the penalized functional regression model is a convenient and valuable imputation tool for methylation data, and it can boost statistical power in downstream epigenome-wide association study (EWAS).
AB - DNA methylation is a key epigenetic mark involved in both normal development and disease progression. Recent advances in high-throughput technologies have enabled genome-wide profiling of DNA methylation. However, DNA methylation profiling often employs different designs and platforms with varying resolution, which hinders joint analysis of methylation data from multiple platforms. In this study, we propose a penalized functional regression model to impute missing methylation data. By incorporating functional predictors, our model utilizes information from nonlocal probes to improve imputation quality. Here, we compared the performance of our functional model to linear regression and the best single probe surrogate in real data and via simulations. Specifically, we applied different imputation approaches to an acute myeloid leukemia dataset consisting of 194 samples and our method showed higher imputation accuracy, manifested, for example, by a 94% relative increase in information content and up to 86% more CpG sites passing post-imputation filtering. Our simulated association study further demonstrated that our method substantially improves the statistical power to identify trait-associated methylation loci. These findings indicate that the penalized functional regression model is a convenient and valuable imputation tool for methylation data, and it can boost statistical power in downstream epigenome-wide association study (EWAS).
UR - http://www.scopus.com/inward/record.url?scp=84963669759&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84963669759&partnerID=8YFLogxK
U2 - 10.1002/gepi.21969
DO - 10.1002/gepi.21969
M3 - Article
C2 - 27061717
AN - SCOPUS:84963669759
SN - 0741-0395
VL - 40
SP - 333
EP - 340
JO - Genetic Epidemiology
JF - Genetic Epidemiology
IS - 4
ER -