In this paper, we argue that n-gram language models are not sufficient to address word reordering required for Machine Translation. We propose a new distortion model that can be used with existing phrase-based SMT decoders to address those n-gram language model limitations. We present empirical results in Arabic to English Machine Translation that show statistically significant improvements when our proposed model is used. We also propose a novel metric to measure word order similarity (or difference) between any pair of languages based on word alignments. © 2006 Association for Computational Linguistics.
CITATION STYLE
Al-Onaizan, Y., & Papineni, K. (2006). Distortion models for statistical machine translation. In COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 529–536). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1220175.1220242
Mendeley helps you to discover research relevant for your work.