Fast construction of FM-index for long sequence reads.

Bioinformatics
Authors
Keywords
Abstract

SUMMARY: We present a new method to incrementally construct the FM-index for both short and long sequence reads, up to the size of a genome. It is the first algorithm that can build the index while implicitly sorting the sequences in the reverse (complement) lexicographical order without a separate sorting step. The implementation is among the fastest for indexing short reads and the only one that practically works for reads of averaged kilobases in length.

AVAILABILITY AND IMPLEMENTATION: https://github.com/lh3/ropebwt2 CONTACT: hengli@broadinstitute.org.

Year of Publication
2014
Journal
Bioinformatics
Volume
30
Issue
22
Pages
3274-5
Date Published
2014 Nov 15
ISSN
1367-4811
URL
DOI
10.1093/bioinformatics/btu541
PubMed ID
25107872
PubMed Central ID
PMC4221129
Links
Grant list
GM100233 / GM / NIGMS NIH HHS / United States
U54HG003037 / HG / NHGRI NIH HHS / United States