NGSPERL: A semi-automated framework for large scale next generation sequencing data analysis

Quanhu Sheng, Shilin Zhao, Mingsheng Guo, Yu Shyr

研究成果: Article同行評審

1 引文 斯高帕斯(Scopus)

摘要

High-throughput sequencing technologies have been widely used in medical and biological research, especially in cancer biology. With the huge amounts of sequencing data being generated, data analysis has become the bottle-neck of the research procedure. We have designed and implemented NGSPERL, a semi-automated module-based framework, for high-throughput sequencing data analysis. Three major analysis pipelines with multiple tasks have been developed for RNA sequencing, exome sequencing, and small RNA sequencing data. Each task was developed as module. The module uses the output from the previous task as the input parameter to generate the corresponding portable batch system (PBS) script. The PBS scripts can be either submitted to cluster or run directly based on user choice. Multiple tasks can also be combined together as a single task to simplify the data analysis. Such a flexible framework will significantly automate and simplify the process of large scale sequencing data analysis.

原文English
頁(從 - 到)203-211
頁數9
期刊International Journal of Computational Biology and Drug Design
8
發行號3
DOIs
出版狀態Published - 2015

All Science Journal Classification (ASJC) codes

  • 藥物發現
  • 電腦科學應用

指紋

深入研究「NGSPERL: A semi-automated framework for large scale next generation sequencing data analysis」主題。共同形成了獨特的指紋。

引用此