Abstract
We extend a current sequence-tagging approach to Grammatical Error Correction (GEC) by introducing specialised tags for spelling correction and morphological inflection using the SymSpell and LemmInflect algorithms. Our approach improves generalisation: the proposed new tagset allows a smaller number of tags to correct a larger range of errors. Our results show a performance improvement both overall and in the targeted error categories. We further show that ensembles trained with our new tagset outperform those trained with the baseline tagset on the public BEA benchmark.
Cite
CITATION STYLE
Mesham, S., Bryant, C., Rei, M., & Yuan, Z. (2023). An Extended Sequence Tagging Vocabulary for Grammatical Error Correction. In EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023 (pp. 1563–1574). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-eacl.119
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.