Fast and accurate genome anchoring using fuzzy hash maps

2Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Although hash-based approaches to sequence alignment and genome assembly are long established, their utility is predicated on the rapid identification of exact k-mers from a hash-map or similar data structure. We describe how a fuzzy hash-map can be applied to quickly and accurately align a prokaryotic genome to the reference genome of a related species. Using this technique, a draft genome of Mycoplasma genitalium, sampled at 1X coverage, was accurately anchored against the genome of Mycoplasma pneumoniae. The fuzzy approach to alignment, ordered and orientated more than 65% of the reads from the draft genome in under 10 seconds, with an error rate of <1.5%. Without sacrificing execution speed, fuzzy hash-maps also provide a mechanism for error tolerance and variability in k-mer centric sequence alignment and assembly applications. © 2011 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Healy, J., & Chambers, D. (2011). Fast and accurate genome anchoring using fuzzy hash maps. In Advances in Intelligent and Soft Computing (Vol. 93, pp. 149–156). https://doi.org/10.1007/978-3-642-19914-1_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free