In this paper, we introduce the Greek version of the automatic annotation tool ERRANT (Bryant et al., 2017), which we named ELERRANT. ERRANT functions as a rule-based error type classifier and was used as the main evaluation tool of the systems participating in the BEA-2019 (Bryant et al., 2019) shared task. Here, we discuss grammatical and morphological differences between English and Greek and how these differences affected the development of ELERRANT. We also introduce the first Greek Native Corpus (GNC) and the Greek WikiEdits Corpus (GWE), two new evaluation datasets with errors from native Greek learners and Wikipedia Talk Pages edits respectively. These two datasets are used for the evaluation of ELERRANT. This paper is a sole fragment of a bigger picture which illustrates the attempt to solve the problem of low-resource languages in NLP, in our case Greek.
CITATION STYLE
Korre, K., Chatzipanagiotou, M., & Pavlopoulos, J. (2021). ELERRANT: Automatic Grammatical Error Type Classification for Greek. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 708–717). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_081
Mendeley helps you to discover research relevant for your work.