FSG: Fast string graph construction for de novo assembly of reads data

1Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The string graph for a collection of next-generation reads is a lossless data representation that is fundamental for de novo assemblers based on the overlap-layout-consensus paradigm. In this paper, we explore a novel approach to compute the string graph, based on the FMindex and Burrows-Wheeler Transform (BWT). We describe a simple algorithm that uses only the FM-index representation of the collection of reads to construct the string graph, without accessing the input reads. Our algorithm has been integrated into the SGA assembler as a standalone module to construct the string graph. The new integrated assembler has been assessed on a standard benchmark, showing that FSG is significantly faster than SGA while maintaining a moderate use of main memory, and showing practical advantages in running FSG on multiple threads.

Cite

CITATION STYLE

APA

Bonizzoni, P., Della Vedova, G., Pirola, Y., Previtali, M., & Rizzi, R. (2016). FSG: Fast string graph construction for de novo assembly of reads data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9683, pp. 27–39). Springer Verlag. https://doi.org/10.1007/978-3-319-38782-6_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free