Partially distribution-free learning of regular languages from positive samples

Alexander Clark; Franck Thollard

Conference Proceedings

Partially distribution-free learning of regular languages from positive samples

COLING 2004 - Proceedings of the 20th International Conference on Computational Linguistics (2004)

DOI: 10.3115/1220355.1220368

11Citations

79Readers

Get full text

Abstract

Regular languages are widely used in NLP today in spite of their shortcomings. Efficient algorithms that can reliably learn these languages, and which must in realistic applications only use positive samples, are necessary. These languages are not learnable under traditional distribution free criteria. We claim that an appropriate learning framework is PAC learning where the distributions are constrained to be generated by a class of stochastic automata with support equal to the target concept. We discuss how this is related to other learning paradigms. We then present a simple learning algorithm for regular languages, and a self-contained proof that it learns according to this partially distribution free criterion.

Cite

CITATION STYLE

APA

Clark, A., & Thollard, F. (2004). Partially distribution-free learning of regular languages from positive samples. In COLING 2004 - Proceedings of the 20th International Conference on Computational Linguistics. Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220355.1220368

Partially distribution-free learning of regular languages from positive samples

Abstract

Cite

Register to see more suggestions