Application of a superword array in genome assembly

23Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We introduce a data structure called a superword array for finding quickly matches between DNA sequences. The superword array possesses some desirable features of the lookup table and suffix array. We describe simple algorithms for constructing and using a superword array to find pairs of sequences that share a unique superword. The algorithms are implemented in a genome assembly program called PCAP.REP for computation of overlaps between reads. Experimental results produced by PCAP.REP and PCAP on a whole-genome dataset show that PCAP.REP produced a more accurate and contiguous assembly than PCAP. © The Author 2006. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Huang, X., Yang, S. P., Chinwalla, A. T., Hillier, L. D. W., Minx, P., Mardis, E. R., & Wilson, R. K. (2006). Application of a superword array in genome assembly. Nucleic Acids Research, 34(1), 201–205. https://doi.org/10.1093/nar/gkj419

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free