DSAP: Deep-sequencing small RNA analysis pipeline

Po Jung Huang, Yi Chung Liu, Chi Ching Lee, Wei Chen Lin, Richie Ruei Chi Gan, Ping Chiang Lyu, Petrus Tang

Research output: Contribution to journalArticlepeer-review

80 Citations (Scopus)


DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log2-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

Original languageEnglish
Article numbergkq392
Pages (from-to)W385-W391
JournalNucleic acids research
Issue numberSUPPL. 2
Publication statusPublished - 2010 May 15

All Science Journal Classification (ASJC) codes

  • Genetics


Dive into the research topics of 'DSAP: Deep-sequencing small RNA analysis pipeline'. Together they form a unique fingerprint.

Cite this