IDBA - A practical iterative De Bruijn graph De Novo assembler

Yu Peng; Henry C.M. Leung; S. M. Yiu; Francis Y.L. Chin

Conference Proceedings

IDBA - A practical iterative De Bruijn graph De Novo assembler

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6044 LNBI 426-440

DOI: 10.1007/978-3-642-12683-3_28

208Citations

273Readers

Get full text

Abstract

The de Bruijn graph assembly approach breaks reads into k-mers before assembling them into contigs. The string graph approach forms contigs by connecting two reads with k or more overlapping nucleotides. Both approaches must deal with the following problems: false-positive vertices, due to erroneous reads; gap problem, due to non-uniform coverage; branching problem, due to erroneous reads and repeat regions. A proper choice of k is crucial but for single k there is always a trade-off: a small k favors the situation of erroneous reads and non-uniform coverage, and a large k favors short repeat regions. We propose an iterative de Bruijn graph approach iterating from small to large k exploring the advantages of the in between values. Our IDBA outperforms the existing algorithms by constructing longer contigs with similar accuracy and using less memory, both with real and simulated data. The running time of the algorithm is comparable to existing algorithms. © Springer-Verlag Berlin Heidelberg 2010.

Author supplied keywords

Cite

CITATION STYLE

APA

Peng, Y., Leung, H. C. M., Yiu, S. M., & Chin, F. Y. L. (2010). IDBA - A practical iterative De Bruijn graph De Novo assembler. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6044 LNBI, pp. 426–440). https://doi.org/10.1007/978-3-642-12683-3_28

IDBA - A practical iterative De Bruijn graph De Novo assembler

Abstract

Author supplied keywords

Cite

Register to see more suggestions