Translation of regular expression with lookahead into finite state automaton

ISSN: 02896540
9Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.

Abstract

Most of the conventional implementations of regular expressions are based on backtracking. Such implementations are slow in the worst case, and thus, we would like to develop a better matching algorithm. However, it is nontrivial to provide an efficient matching algorithm that can deal with practical extensions including submatch addressing. This paper studies regular expression with lookaheads and negative lookaheads, abbreviated to REwLA. First, we propose a transformation from a REwLA of size m to a deterministic finite automaton of O(2 2m) states. Next, we consider weighted regular expressions, which enable us to calculate submatch addressing. We propose a transformation from a weighted REwLA of size m to a weighted nondeterministic finite automaton of O(2 2m) states.

Cite

CITATION STYLE

APA

Morihata, A. (2012). Translation of regular expression with lookahead into finite state automaton. Computer Software, 29(1), 147–158.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free