Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric

7Citations
Citations of this article
81Readers
Mendeley users who have this article in their library.

Abstract

It is difficult to rank and evaluate the performance of grammatical error correction (GEC) systems, as a sentence can be rewritten in numerous correct ways. A number of GEC metrics have been used to evaluate proposed GEC systems; however, each system relies on either a comparison with one or more reference texts-in what is known as the gold standard for reference-based metrics-or a separate annotated dataset to fine-tune the reference-less metric. Reference-based systems have a low correlation with human judgement, cannot capture all the ways in which a sentence can be corrected, and require substantial work to develop a test dataset. We propose a reference-less GEC evaluation system that is strongly correlated with human judgement, solves the issues related to the use of a reference, and does not need another annotated dataset for fine-tuning. The proposed system relies solely on commonly available tools. Additionally, currently available reference-less metrics do not work properly when part of a sentence is repeated as opposed to reference-based metrics. In our proposed system, we look to address issues inherent in reference-less metrics and reference-based metrics.

Cite

CITATION STYLE

APA

Islam, M. A., & Magnani, E. (2021). Is this the end of the gold standard? A straightforward reference-less grammatical error correction metric. In EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 3009–3015). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.emnlp-main.239

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free