Type inference for unique pattern matching

Stijn Vansummeren

Journal ArticleOPEN ACCESS

Type inference for unique pattern matching

Vansummeren S

ACM Transactions on Programming Languages and Systems (2006) 28(3) 389-428

DOI: 10.1145/1133651.1133652

19Citations

6Readers

Abstract

Regular expression patterns provide a natural, declarative way to express constraints on semistructured data and to extract relevant information from it. Indeed, it is a core feature of the programming language Perl, surfaces in various UNIX tools such as sad and awk, and has recently been proposed in the context of the XML programming language XDuce. Since regular expressions can be ambiguous in general, different disambiguation policies have been proposed to get a unique matching strategy. We formally define the matching semantics under both (1) the POSIX, and (2) the first and longest match disambiguation strategies. We show that the generally accepted method of defining the longest match in terms of the first match and recursion does not conform to the natural notion of longest match. We continue by solving the type inference problem for both disambiguation strategies, which consists of calculating the set of all subparts of input values a subexpression can match under the given policy. © 2006 ACM.

Author supplied keywords

Cite

CITATION STYLE

APA

Vansummeren, S. (2006). Type inference for unique pattern matching. ACM Transactions on Programming Languages and Systems, 28(3), 389–428. https://doi.org/10.1145/1133651.1133652

Type inference for unique pattern matching

Abstract

Author supplied keywords

Cite

Register to see more suggestions