We investigate the differences between language models compiled from original target-language texts and those compiled from texts manually translated to the target language. Corroborating established observations of Translation Studies, we demonstrate that the latter are significantly better predictors of translated sentences than the former, and hence fit the reference set better. Furthermore, translated texts yield better language models for statistical machine translation than original texts. © 2012 Association for Computational Linguistics.
CITATION STYLE
Lembersky, G., Ordan, N., & Wintner, S. (2012). Language models for machine translation: Original vs. translated texts. Computational Linguistics, 38(4), 799–825. https://doi.org/10.1162/COLI_a_00111
Mendeley helps you to discover research relevant for your work.