Truncated DAWGs and their application to minimal absent word problem

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The directed acyclic word graph (DAWG) of a string y is the smallest (partial) DFA which recognizes all suffixes of y and has O(n) nodes and edges. Na et al. [11] proposed k-truncated suffix tree which is a compressed trie that represents substrings of a string whose length up to k. In this paper, we present a new data structure called k-truncated DAWGs, which can be obtained by pruning the DAWGs. We show that the size complexity of the k-truncated DAWG of a string y of length n is O(min{{n,kz}) which is equal to the truncated suffix tree’s one, where z is the size of LZ77 factorization of y. We also present an O(n log σ) time and O(min{{n,kz}) space algorithm for constructing the k-truncated DAWG of y, where σ is the alphabet size. As an application of the truncated DAWGs, we show that the set MAWk(y) of all minimal absent words of y whose length is smaller than or equal to k can be computed by using k-truncated DAWG of y in O(min{{n,kz}) + |MAWk(y)|) time and O(min{{n,kz}) working space.

Cite

CITATION STYLE

APA

Fujishige, Y., Takagi, T., & Hendrian, D. (2018). Truncated DAWGs and their application to minimal absent word problem. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11147 LNCS, pp. 139–152). Springer Verlag. https://doi.org/10.1007/978-3-030-00479-8_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free