摘要
The suffix array and its variants are text-indexing data structures that have become indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an accessible exposition of the SA-IS algorithm, which is the state of the art in suffix array construction.We also describe DisLex, a technique that allows standard suffix array construction algorithms to create modified suffix arrays designed to enable a simple form of inexact matching needed to support 'spaced seeds' and 'subset seeds' used in many biological applications.
原文 | English |
---|---|
文章編號 | bbt081 |
頁(從 - 到) | 138-154 |
頁數 | 17 |
期刊 | Briefings in bioinformatics |
卷 | 15 |
發行號 | 2 |
DOIs | |
出版狀態 | Published - 2014 三月 |
All Science Journal Classification (ASJC) codes
- Information Systems
- Molecular Biology