Abstract
This paper explores the generation of artificial errors for correcting grammatical mistakes made by learners of English as a second language. Artificial errors are injected into a set of error-free sentences in a probabilistic manner using statistics from a corpus. Unlike previous approaches, we use linguistic information to derive error generation probabilities and build corpora to correct several error types, including open-class errors. In addition, we also analyse the variables involved in the selection of candidate sentences. Experiments using the NUCLE corpus from the CoNLL 2013 shared task reveal that: 1) training on artificially created errors improves precision at the expense of recall and 2) different types of linguistic information are better suited for correcting different error types.
Cite
CITATION STYLE
Felice, M., & Yuan, Z. (2014). Generating artificial errors for grammatical error correction. In EACL 2014 - 14th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (pp. 116–126). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/e14-3013
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.