Integrative gene set analysis of multi-platform data with sample heterogeneity

Jun Hu, Jung Ying Tzeng

研究成果: Article同行評審

9 引文 斯高帕斯(Scopus)


Motivation: Gene set analysis is a popular method for large-scale genomic studies. Because genes that have common biological features are analyzed jointly, gene set analysis often achieves better power and generates more biologically informative results. With the advancement of technologies, genomic studies with multi-platform data have become increasingly common. Several strategies have been proposed that integrate genomic data from multiple platforms to perform gene set analysis. To evaluate the performances of existing integrative gene set methods under various scenarios, we conduct a comparative simulation analysis based on The Cancer Genome Atlas breast cancer dataset. Results: We find that existing methods for gene set analysis are less effective when sample heterogeneity exists. To address this issue, we develop three methods for multi-platform genomic data with heterogeneity: two non-parametric methods, multi-platform Mann-Whitney statistics and multi-platform outlier robust T-statistics, and a parametric method, multi-platform likelihood ratio statistics. Using simulations, we show that the proposed multi-platform Mann-Whitney statistics method has higher power for heterogeneous samples and comparable performance for homogeneous samples when compared with the existing methods. Our real data applications to two datasets of The Cancer Genome Atlas also suggest that the proposed methods are able to identify novel pathways that are missed by other strategies.

頁(從 - 到)1501-1507
出版狀態Published - 2014 6月 1

All Science Journal Classification (ASJC) codes

  • 統計與概率
  • 生物化學
  • 分子生物學
  • 電腦科學應用
  • 計算機理論與數學
  • 計算數學


深入研究「Integrative gene set analysis of multi-platform data with sample heterogeneity」主題。共同形成了獨特的指紋。