Efficient Signal Inclusion With Genomic Applications

X. Jessie Jeng, Teng Zhang, Jung Ying Tzeng

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

This article addresses the challenge of efficiently capturing a high proportion of true signals for subsequent data analyses when sample sizes are relatively limited with respect to data dimension. We propose the signal missing rate (SMR) as a new measure for false-negative control to account for the variability of false-negative proportion. Novel data-adaptive procedures are developed to control SMR without incurring many unnecessary false positives under dependence. We justify the efficiency and adaptivity of the proposed methods via theory and simulation. The proposed methods are applied to GWAS on human height to effectively remove irrelevant single nucleotide polymorphisms (SNPs) while retaining a high proportion of relevant SNPs for subsequent polygenic analysis. Supplementary materials for this article are available online.

Original languageEnglish
Pages (from-to)1787-1799
Number of pages13
JournalJournal of the American Statistical Association
Volume114
Issue number528
DOIs
Publication statusPublished - 2019 Oct 2

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint Dive into the research topics of 'Efficient Signal Inclusion With Genomic Applications'. Together they form a unique fingerprint.

Cite this