Abstract
We provide an in-depth analysis of the integration of an Arabic-to-English transliteration system into a general-purpose phrase-based statistical machine translation system. We study the integration from different aspects and evaluate the improvement that can be attributed to the integration using the BLEU metric. Our experiments show that a transliteration module can help significantly in the situation where the test data is rich with previously unseen named entities. We obtain 70% and 53% of the theoretical maximum improvement we could achieve, as measured by an oracle on development and test sets respectively for OOV words (out of vocabulary source words not appearing in the phrase table).
Cite
CITATION STYLE
Kashani, M. M., Joanis, E., Kuhn, R., Foster, G., & Popowich, F. (2007). Integration of an Arabic transliteration module into a statistical machine translation system. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 17–24). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1626355.1626358
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.