Using cascading Bloom filters to improve the memory usage for de Brujin graphs

50Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: De Brujin graphs are widely used in bioinformatics for processing next-generation sequencing data. Due to a very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently. Results: In this work, we show how to reduce the memory required by the data structure of Chikhi and Rizk (WABI'12) that represents de Brujin graphs using Bloom filters. Our method requires 30% to 40% less memory with respect to their method, with insignificant impact on construction time. At the same time, our experiments showed a better query time compared to the method of Chikhi and Rizk.Conclusion: The proposed data structure constitutes, to our knowledge, currently the most efficient practical representation of de Bruijn graphs. © 2014 Salikhov et al.; licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Salikhov, K., Sacomoto, G., & Kucherov, G. (2014). Using cascading Bloom filters to improve the memory usage for de Brujin graphs. Algorithms for Molecular Biology, 9(1). https://doi.org/10.1186/1748-7188-9-2

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free