Searching for supermaximal repeats in large DNA sequences

Chen Na Lian; Mihail Halachev; Nematollaah Shiri

Journal Article

Searching for supermaximal repeats in large DNA sequences

Communications in Computer and Information Science (2008) 13 87-101

DOI: 10.1007/978-3-540-70600-7_7

4Citations

4Readers

Get full text

Abstract

We study the problem of finding supermaximal repeats in large DNA sequences. For this, we propose an algorithm called SMR which uses an auxiliary index structure (POL), which is derived from and replaces the suffix tree index STTD64 [1]. The results of our numerous experiments using the 24 human chromosomes data indicate that SMR outperforms the solution provided as part of the Vmatch [2] software tool. In searching for supermaximal repeats of size at least 10 bases, SMR is twice faster than Vmatch; for a minimum length of 25 bases, SMR is 7 times faster; and for repeats of length at least 200, SMR is about 9 times faster. We also study the cost of POL in terms of time and space requirements. © Springer-Verlag Berlin Heidelberg 2008.

Author supplied keywords

Cite

CITATION STYLE

APA

Lian, C. N., Halachev, M., & Shiri, N. (2008). Searching for supermaximal repeats in large DNA sequences. Communications in Computer and Information Science, 13, 87–101. https://doi.org/10.1007/978-3-540-70600-7_7

Searching for supermaximal repeats in large DNA sequences

Abstract

Author supplied keywords

Cite

Register to see more suggestions