Abstract
We demonstrate the feasibility of using unsupervised morphological segmentation for dialects of Arabic, which are poor in linguistics resources. Our experiments using a Qatari Arabic to English machine translation system show that unsupervised segmentation helps to improve the translation quality as compared to using no segmentation or to using ATB segmentation, which was especially designed for Modern Standard Arabic (MSA). We use MSA and other dialects to improve Qatari Arabic to English machine translation, and we show that a uniform segmentation scheme across them yields an improvement of 1.5 BLEU points over using no segmentation.
Cite
CITATION STYLE
Al-Mannai, K., Sajjad, H., Khader, A., Obaidli, F. A., Nakov, P., & Vogel, S. (2014). Unsupervised Word Segmentation Improves Dialectal Arabic to English Machine Translation. In ANLP 2014 - EMNLP 2014 Workshop on Arabic Natural Language Processing, Proceedings (pp. 207–216). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/w14-3628
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.