Fishing in read collections: Memory efficient indexing for sequence assembly

Vladimír Boža; Jakub Jursa; Broňa Brejová; Tomáš Vinař

Conference Proceedings

Fishing in read collections: Memory efficient indexing for sequence assembly

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9309 188-198

DOI: 10.1007/978-3-319-23826-5_19

0Citations

4Readers

Get full text

Abstract

In this paper, we present a memory efficient index for storing a large set of DNA sequencing reads. The index allows us to quickly retrieve the set of reads containing a certain query k-mer. Instead of the usual approach of treating each read as a separate string, we take an advantage of significant overlap between reads and compress the data by aligning the reads to an approximate superstring constructed specifically for this purpose in combination with several succint data structures.

Cite

CITATION STYLE

APA

Boža, V., Jursa, J., Brejová, B., & Vinař, T. (2015). Fishing in read collections: Memory efficient indexing for sequence assembly. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9309, pp. 188–198). Springer Verlag. https://doi.org/10.1007/978-3-319-23826-5_19

Fishing in read collections: Memory efficient indexing for sequence assembly

Abstract

Cite

Register to see more suggestions