TY - JOUR
T1 - An evaluation of allele frequency estimation accuracy using pooled sequencing data
AU - Guo, Yan
AU - Cai, Qiuyin
AU - Li, Chun
AU - Li, Jiang
AU - Li, Chung I.
AU - Courtney, Regina
AU - Zheng, Wei
AU - Long, Jirong
PY - 2013
Y1 - 2013
N2 - Next generation sequencing technology has matured, and with its current affordability, will replace the SNP chip as the genotyping tool of choice. Even with the current affordability of NGS, large scale studies will require careful study design to reduce cost. In this study, we designed an experiment to assess the accuracy of allele frequency estimated from pooled sequencing data. We compared the allele frequency estimated from sequencing data with the allele frequency estimated from individual SNP chip data and observed high correlations between them. However, by calculating error rate, we found that many SNPs had their allele frequency estimated from sequencing data significantly different from allele frequency estimated from SNP chip data. In conclusion, we found correlation is not an ideal measurement for comparing allele frequencies. And for the purpose of estimating allele frequency, we do not recommend using pooling with NGS as a cheaper alternative to genotype each sample individually.
AB - Next generation sequencing technology has matured, and with its current affordability, will replace the SNP chip as the genotyping tool of choice. Even with the current affordability of NGS, large scale studies will require careful study design to reduce cost. In this study, we designed an experiment to assess the accuracy of allele frequency estimated from pooled sequencing data. We compared the allele frequency estimated from sequencing data with the allele frequency estimated from individual SNP chip data and observed high correlations between them. However, by calculating error rate, we found that many SNPs had their allele frequency estimated from sequencing data significantly different from allele frequency estimated from SNP chip data. In conclusion, we found correlation is not an ideal measurement for comparing allele frequencies. And for the purpose of estimating allele frequency, we do not recommend using pooling with NGS as a cheaper alternative to genotype each sample individually.
UR - http://www.scopus.com/inward/record.url?scp=84885067990&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84885067990&partnerID=8YFLogxK
U2 - 10.1504/IJCBDD.2013.056709
DO - 10.1504/IJCBDD.2013.056709
M3 - Article
C2 - 24088264
AN - SCOPUS:84885067990
SN - 1756-0756
VL - 6
SP - 279
EP - 293
JO - International Journal of Computational Biology and Drug Design
JF - International Journal of Computational Biology and Drug Design
IS - 4
ER -