Unsupervised grammar inference using the minimum description length principle

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Context Free Grammars (CFGs) are widely used in programming language descriptions, natural language processing, compilers, and other areas of software engineering where there is a need for describing the syntactic structures of programs. Grammar inference (GI) is the induction of CFGs from sample programs and is a challenging problem. We describe an unsupervised GI approach which uses simplicity as the criterion for directing the inference process and beam search for moving from a complex to a simpler grammar. We use several operators to modify a grammar and use the Minimum Description Length (MDL) Principle to favor simple and compact grammars. The effectiveness of this approach is shown by a case study of a domain specific language. The experimental results show that an accurate grammar can be inferred in a reasonable amount of time. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Sapkota, U., Bryant, B. R., & Sprague, A. (2012). Unsupervised grammar inference using the minimum description length principle. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7376 LNAI, pp. 141–153). https://doi.org/10.1007/978-3-642-31537-4_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free