Motivation High-throughput genomic data often contain unexpected information that can be mined for alternative applications. Despite the rise of high-throughput sequencing, Illumina genotyping arrays remain a driving force in large scale genetic and epidemiology studies. By processing and analyzing genotyping data of over 100,000 samples genotyped on Illumina genotyping arrays, we discovered evidence that indicates that mitochondrial heteroplasmy can be estimated from the fluorescence intensity data of the array. To verify our hypothesis, we conducted a sequencing validation study. Result Mitochondrial DNA targeted sequencing was performed on three samples that had been genotyped using the Illumina exome genotyping array. In each sample chosen, one heteroplasmy target was identified from the genotyping array, and sequencing data verified all three putative heteroplasmic sites. The estimated heteroplasmy level difference between that estimated from the genotyping fluorescence intensity and that directly measured from sequencing was 3.2% on average. Our analysis showed that an Illumina genotyping array can accurately and reliably estimate high-level heteroplasmy (> 40%); however, intensity data from a genotyping array is not suitable for estimating low level heteroplasmy (< 25%).
All Science Journal Classification (ASJC) codes
- Molecular Medicine
- Molecular Biology
- Cell Biology