Analysis of maximal repetitions in strings

21Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The cornerstone of any algorithm computing all repetitions in strings of length n in script O sign(n) time is the fact that the number of maximal repetitions (runs) is linear. Therefore, the most important part of the analysis of the running time of such algorithms is counting the number of runs. Kolpakov and Kucherov [FOCS'99] proved it to be en but could not provide any value for c. Recently, Rytter [STACS'06] proved that c ≤ 5. His analysis has been improved by Puglisi et al. to obtain 3.48 and by Rytter to 3.44 (both submitted). The conjecture of Kolpakov and Kucherov, supported by computations, is that c = 1. Here we improve dramatically the previous results by proving that c < 1.6 and show how it could be improved by computer verification down to 1.18 or less. While the conjecture may be very difficult to prove, we believe that our work provides a good approximation for all practical purposes. For the stronger result concerning the linearity of the sum of exponents, we give the first explicit bound: 5.6n. Kolpakov and Kucherov did not have any and Rytter considered "unsatisfactory" the bound that could be deduced from his proof. Our bound could be as well improved by computer verification down to 2.9n or less. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Crochemore, M., & Ilie, L. (2007). Analysis of maximal repetitions in strings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4708 LNCS, pp. 465–476). Springer Verlag. https://doi.org/10.1007/978-3-540-74456-6_42

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free