Silver Data for Coreference Resolution in Ukrainian: Translation, Alignment, and Projection

Pavlo Kuchmiichuk

Conference Proceedings

Silver Data for Coreference Resolution in Ukrainian: Translation, Alignment, and Projection

Kuchmiichuk P

EACL 2023 - 2nd Ukrainian Natural Language Processing Workshop, UNLP 2023 - Proceedings of the Workshop (2023) 62-72

DOI: 10.18653/v1/2023.unlp-1.8

0Citations

18Readers

Get full text

Abstract

Low-resource languages continue to present challenges for current NLP methods, and multilingual NLP is gaining attention in the research community. One of the main issues is the lack of sufficient high-quality annotated data for low-resource languages. In this paper, we show how labeled data for high-resource languages such as English can be used in low-resource NLP. We present two silver datasets for coreference resolution in Ukrainian, adapted from existing English data by manual translation and machine translation in combination with automatic alignment and annotation projection. The code is made publicly available1

Cite

CITATION STYLE

APA

Kuchmiichuk, P. (2023). Silver Data for Coreference Resolution in Ukrainian: Translation, Alignment, and Projection. In EACL 2023 - 2nd Ukrainian Natural Language Processing Workshop, UNLP 2023 - Proceedings of the Workshop (pp. 62–72). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.unlp-1.8

Silver Data for Coreference Resolution in Ukrainian: Translation, Alignment, and Projection

Abstract

Cite

Register to see more suggestions