Language model based grammatical error correction without annotated training data

Christopher Bryant; Ted Briscoe

Conference ProceedingsOPEN ACCESS

Language model based grammatical error correction without annotated training data

Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 (2018) 247-253

DOI: 10.18653/v1/w18-0529

43Citations

153Readers

Abstract

Since the end of the CoNLL-2014 shared task on grammatical error correction (GEC), research into language model (LM) based approaches to GEC has largely stagnated. In this paper, we re-examine LMs in GEC and show that it is entirely possible to build a simple system that not only requires minimal annotated data (∼1000 sentences), but is also fairly competitive with several state-of-the-art systems. This approach should be of particular interest for languages where very little annotated training data exists, although we also hope to use it as a baseline to motivate future research.

Cite

CITATION STYLE

APA

Bryant, C., & Briscoe, T. (2018). Language model based grammatical error correction without annotated training data. In Proceedings of the 13th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 (pp. 247–253). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-0529

Language model based grammatical error correction without annotated training data

Abstract

Cite

Register to see more suggestions