Learning Unions of k-Testable Languages

Alexis Linard; Colin de la Higuera; Frits Vaandrager

Conference Proceedings

Learning Unions of k-Testable Languages

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11417 LNCS 328-339

DOI: 10.1007/978-3-030-13435-8_24

3Citations

4Readers

Get full text

Abstract

A classical problem in grammatical inference is to identify a language from a set of examples. In this paper, we address the problem of identifying a union of languages from examples that belong to several different unknown languages. Indeed, decomposing a language into smaller pieces that are easier to represent should make learning easier than aiming for a too generalized language. In particular, we consider k-testable languages in the strict sense (k-TSS). These are defined by a set of allowed prefixes, infixes (sub-strings) and suffixes that words in the language may contain. We establish a Galois connection between the lattice of all languages over alphabet and the lattice of k-TSS languages over. We also define a simple metric on k-TSS languages. The Galois connection and the metric allow us to derive an efficient algorithm to learn the union of k-TSS languages. We evaluate our algorithm on an industrial dataset and thus demonstrate the relevance of our approach.

Author supplied keywords

Cite

CITATION STYLE

APA

Linard, A., de la Higuera, C., & Vaandrager, F. (2019). Learning Unions of k-Testable Languages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11417 LNCS, pp. 328–339). Springer Verlag. https://doi.org/10.1007/978-3-030-13435-8_24

Learning Unions of k-Testable Languages

Abstract

Author supplied keywords

Cite

Register to see more suggestions