Integrating an Unsupervised Transliteration Model into Statistical Machine Translation

73Citations
Citations of this article
91Readers
Mendeley users who have this article in their library.

Abstract

We investigate three methods for integrating an unsupervised transliteration model into an end-to-end SMT system. We induce a transliteration model from parallel data and use it to translate OOV words. Our approach is fully unsupervised and language independent. In the methods to integrate transliterations, we observed improvements from 0.23-0.75 (∆ 0.41) BLEU points across 7 language pairs. We also show that our mined transliteration corpora provide better rule coverage and translation quality compared to the gold standard transliteration corpora.

Cite

CITATION STYLE

APA

Durrani, N., Sajjad, H., Hoang, H., & Koehn, P. (2014). Integrating an Unsupervised Transliteration Model into Statistical Machine Translation. In EACL 2014 - 14th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 148–153). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/e14-4029

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free