A quality control algorithm for filtering SNPs in genome-wide association studies

Monnat Pongpanich, Patrick F. Sullivan, Jung Ying Tzeng

研究成果: Article同行評審

19 引文 斯高帕斯(Scopus)

摘要

Motivation: The quality control (QC) filtering of single nucleotide polymorphisms (SNPs) is an important step in genome-wide association studies to minimize potential false findings. SNP QC commonly uses expert-guided filters based on QC variables [e.g. Hardy-Weinberg equilibrium, missing proportion (MSP) and minor allele frequency (MAF)] to remove SNPs with insufficient genotyping quality. The rationale of the expert filters is sensible and concrete, but its implementation requires arbitrary thresholds and does not jointly consider all QC features. Results: We propose an algorithm that is based on principal component analysis and clustering analysis to identify low-quality SNPs. The method minimizes the use of arbitrary cutoff values, allows a collective consideration of the QC features and provides conditional thresholds contingent on other QC variables (e.g. different MSP thresholds for different MAFs). We apply our method to the seven studies from the Wellcome Trust Case Control Consortium and the major depressive disorder study from the Genetic Association Information Network. We measured the performance of our method compared to the expert filters based on the following criteria: (i) percentage of SNPs excluded due to low quality; (ii) inflation factor of the test statistics (λ); (iii) number of false associations found in the filtered dataset; and (iv) number of true associations missed in the filtered dataset. The results suggest that with the same or fewer SNPs excluded, the proposed algorithm tends to give a similar or lower value of λ, a reduced number of false associations, and retains all true associations.

原文English
文章編號btq272
頁(從 - 到)1731-1737
頁數7
期刊Bioinformatics
26
發行號14
DOIs
出版狀態Published - 2010 5月 25

All Science Journal Classification (ASJC) codes

  • 統計與概率
  • 生物化學
  • 分子生物學
  • 電腦科學應用
  • 計算機理論與數學
  • 計算數學

指紋

深入研究「A quality control algorithm for filtering SNPs in genome-wide association studies」主題。共同形成了獨特的指紋。

引用此