Distributed generation of suffix arrays

Gonzalo Navarro; João Paulo Kitajima; Berthier A. Ribeiro-Neto; Nivio Ziviani

Conference Proceedings

Distributed generation of suffix arrays

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (1997) 1264 102-115

DOI: 10.1007/3-540-63220-4_54

11Citations

11Readers

Get full text

Abstract

An algorithm for the distributed computation of suffix arrays for large texts is presented. The parallelism model is that of a set of sequential tasks which execute in parallel and exchange messages among them. The underlying architecture is that of a high bandwidth network of processors. Our algorithm builds the suffix array by quickly assigning an independent subproblem to each processor and completing the process with a final local sorting. We demonstrate that the algorithm has time complexity of O(b log n) computation and O(b) communication in the average case, where b corresponds to the local text size on each processor (i.e., text size n divided by r, the number of processors). This is faster than the best known sequential algorithm and improves over previous parallel algorithms to build suffix arrays, both in time complexity and scaling factor.

Cite

CITATION STYLE

APA

Navarro, G., Kitajima, J. P., Ribeiro-Neto, B. A., & Ziviani, N. (1997). Distributed generation of suffix arrays. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1264, pp. 102–115). Springer Verlag. https://doi.org/10.1007/3-540-63220-4_54

Distributed generation of suffix arrays

Abstract

Cite

Register to see more suggestions