Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation

12Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We demonstrate the feasibility of using unsupervised morphological segmentation for dialects of Arabic, which are poor in linguistics resources. Our experiments using a Qatari Arabic to English machine translation system show that unsupervised segmentation helps to improve the translation quality as compared to using no segmentation or to using ATB segmentation, which was especially designed for Modern Standard Arabic (MSA). We use MSA and other dialects to improve Qatari Arabic to English machine translation, and we show that a uniform segmentation scheme across them yields an improvement of 1.5 BLEU points over using no segmentation.

Cite

CITATION STYLE

APA

Al-Mannai, K., Sajjad, H., Khader, A., Obaidli, F. A., Nakov, P., & Vogel, S. (2014). Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation. In ANLP 2014 - EMNLP 2014 Workshop on Arabic Natural Language Processing, Proceedings (pp. 207–216). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3628

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free