Linear-time construction of compressed suffix arrays using o(n log n)-bit working space for large alphabets

16Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The suffix array is a fundamental index data structure in string algorithms and bioinformatics, and the compressed suffix array (CSA) and the FM-index are its compressed versions. Many algorithms for constructing these index data structures have been developed. Recently, Hon et al. [11] proposed a construction algorithm using O(n · log log |∑|) time and O(n log |∑|)-bit working space, which is the fastest algorithm using O(n log |∑|)-bit working space. In this paper we give an efficient algorithm to construct the index data structures for large alphabets. Our algorithm constructs the suffix array, the CSA, and the FM-index using O(n) time and O(n log |∑| log |∑|α n)-bit working space, where α = log3 2. Our algorithm takes less time and more space than Hon et al.'s algorithm. Our algorithm uses least working space among alphabet-independent linear-time algorithms. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Na, J. C. (2005). Linear-time construction of compressed suffix arrays using o(n log n)-bit working space for large alphabets. In Lecture Notes in Computer Science (Vol. 3537, pp. 57–67). Springer Verlag. https://doi.org/10.1007/11496656_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free