Towards the First Named Entity Recognition of Inuktitut for an Improved Machine Translation

1Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Named Entity Recognition is a crucial step to ensure good quality performance of several Natural Language Processing applications and tools, including machine translation and information retrieval. Moreover, it is considered as a fundamental module of many Natural Language Understanding tasks such as question-answering systems. This paper presents a first study on NER for an under-represented Indigenous Inuit language of Canada, Inuktitut, which lacks linguistic resources and large labeled data. Our proposed NER model for Inuktitut is built by transferring linguistic characteristics from English to Inuktitut, based on either rules or bilingual word embeddings. We provide an empirical study based on a comparison with the state of the art models and as well as intrinsic and extrinsic evaluations. In terms of Recall, Precision and F-score, the obtained results show the effectiveness of the proposed NER methods. Furthermore, it improved the performance of Inuktitut-English Neural Machine Translation.

Cite

CITATION STYLE

APA

Le, N. T., Kasdi, I., & Sadat, F. (2023). Towards the First Named Entity Recognition of Inuktitut for an Improved Machine Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 84–93). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.americasnlp-1.10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free