The AMU system in the CoNLL-2014 shared task: Grammatical error correction by data-intensive and feature-rich statistical machine translation

Marcin Junczys-Dowmunt; Roman Grundkiewicz

Conference Proceedings

The AMU system in the CoNLL-2014 shared task: Grammatical error correction by data-intensive and feature-rich statistical machine translation

CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings of the Shared Task (2014) 25-33

DOI: 10.3115/v1/w14-1703

63Citations

111Readers

Get full text

Abstract

Statistical machine translation toolkits like Moses have not been designed with grammatical error correction in mind. In order to achieve competitive results in this area, it is not enough to simply add more data. Optimization procedures need to be customized, task-specific features should be introduced. Only then can the decoder take advantage of relevant data. We demonstrate the validity of the above claims by combining web-scale language models and large-scale error-corrected texts with parameter tuning according to the task metric and correction-specific features. Our system achieves a result of 35.0% F0.5 on the blind CoNLL-2014 test set, ranking on third place. A similar system, equipped with identical models but without tuned parameters and specialized features, stagnates at 25.4%.

Cite

CITATION STYLE

APA

Junczys-Dowmunt, M., & Grundkiewicz, R. (2014). The AMU system in the CoNLL-2014 shared task: Grammatical error correction by data-intensive and feature-rich statistical machine translation. In CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings of the Shared Task (pp. 25–33). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-1703

The AMU system in the CoNLL-2014 shared task: Grammatical error correction by data-intensive and feature-rich statistical machine translation

Abstract

Cite

Register to see more suggestions