Sample size calculation for differential expression analysis of RNA-seq data under Poisson distribution

Research output: Contribution to journalArticlepeer-review

16 Citations (Scopus)

Abstract

Sample size determination is an important issue in the experimental design of biomedical research. Because of the complexity of RNA-seq experiments, however, the field currently lacks a sample size method widely applicable to differential expression studies utilising RNA-seq technology. In this report, we propose several methods for sample size calculation for single-gene differential expression analysis of RNA-seq data under Poisson distribution. These methods are then extended to multiple genes, with consideration for addressing the multiple testing problem by controlling false discovery rate. Moreover, most of the proposed methods allow for closed-form sample size formulas with specification of the desired minimum fold change and minimum average read count, and thus are not computationally intensive. Simulation studies to evaluate the performance of the proposed sample size formulas are presented; the results indicate that our methods work well, with achievement of desired power. Finally, our sample size calculation methods are applied to three real RNA-seq data sets.

Original languageEnglish
Pages (from-to)358-375
Number of pages18
JournalInternational Journal of Computational Biology and Drug Design
Volume6
Issue number4
DOIs
Publication statusPublished - 2013

All Science Journal Classification (ASJC) codes

  • Drug Discovery
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Sample size calculation for differential expression analysis of RNA-seq data under Poisson distribution'. Together they form a unique fingerprint.

Cite this