Morpho-syntactic arabic preprocessing for Arabic-to-english statistical machine translation

Anas El Isbihani; Shahram Khadivi; Oliver Bender; Hermann Ney

Conference Proceedings

Morpho-syntactic arabic preprocessing for Arabic-to-english statistical machine translation

HLT-NAACL 2006 - Statistical Machine Translation, Proceedings of the Workshop (2006) 15-22

DOI: 10.3115/1654650.1654654

26Citations

108Readers

Get full text

Abstract

The Arabic language has far richer systems of inflection and derivation than English which has very little morphology. This morphology difference causes a large gap between the vocabulary sizes in any given parallel training corpus. Segmentation of inflected Arabic words is a way to smooth its highly morphological nature. In this paper, we describe some statistically and linguistically motivated methods for Arabic word segmentation. Then, we show the efficiency of proposed methods on the Arabic-English BTEC and NIST tasks.

Cite

CITATION STYLE

APA

El Isbihani, A., Khadivi, S., Bender, O., & Ney, H. (2006). Morpho-syntactic arabic preprocessing for Arabic-to-english statistical machine translation. In HLT-NAACL 2006 - Statistical Machine Translation, Proceedings of the Workshop (pp. 15–22). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1654650.1654654

Morpho-syntactic arabic preprocessing for Arabic-to-english statistical machine translation

Abstract

Cite

Register to see more suggestions