Silver Data for Coreference Resolution in Ukrainian: Translation, Alignment, and Projection

0Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Low-resource languages continue to present challenges for current NLP methods, and multilingual NLP is gaining attention in the research community. One of the main issues is the lack of sufficient high-quality annotated data for low-resource languages. In this paper, we show how labeled data for high-resource languages such as English can be used in low-resource NLP. We present two silver datasets for coreference resolution in Ukrainian, adapted from existing English data by manual translation and machine translation in combination with automatic alignment and annotation projection. The code is made publicly available1

Cite

CITATION STYLE

APA

Kuchmiichuk, P. (2023). Silver Data for Coreference Resolution in Ukrainian: Translation, Alignment, and Projection. In EACL 2023 - 2nd Ukrainian Natural Language Processing Workshop, UNLP 2023 - Proceedings of the Workshop (pp. 62–72). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.unlp-1.8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free