The Burrows-Wheeler Transform is a text permutation that has revolutionized the fields of pattern matching and text compression, bridging the gap existing between the two. In this paper we approach the BWT-construction problem generalizing a well-known algorithm—based on backward search and dynamic strings manipulation—to work in a context-wise fashion, using automata on words. Let n, σ, and H k be the text length, the alphabet size, and the k-th order empirical entropy of the text, respectively. Moreover, let H ∗ k = min{H k +1, [log σ]}. Under the word RAM model with word size w ∈ Θ(log n), our algorithm builds theBWTin averageO(nH ∗ k) time using nH ∗ k +o(nH ∗ k) bits of space,where k = log σ (n/ log 2 n) − 1. We experimentally show that our algorithm has very good performances (essentially linear time) on DNA sequences, using about 2.6 bits per input symbol in RAM.
CITATION STYLE
Policriti, A., Gigante, N., & Prezza, N. (2015). Average linear time and compressed space construction of the burrows-wheeler transform. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8977, pp. 587–598). Springer Verlag. https://doi.org/10.1007/978-3-319-15579-1_46
Mendeley helps you to discover research relevant for your work.