Abstract
Current phrase-based statistical machine translation systems process each test sentence in isolation and do not enforce global consistency constraints, even though the test data is often internally consistent with respect to topic or style. We propose a new consistency model for machine translation in the form of a graph-based semi-supervised learning algorithm that exploits similarities between training and test data and also similarities between different test sentences. The algorithm learns a regression function jointly over training and test data and uses the resulting scores to rerank translation hypotheses. Evaluation on two travel expression translation tasks demonstrates improvements of up to 2.6 BLEU points absolute and 2.8% in PER. © 2009 Association for Computational Linguistics.
Cite
CITATION STYLE
Alexandrescu, A., & Kirchhoff, K. (2009). Graph-based learning for statistical machine translation. In NAACL HLT 2009 - Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Conference (pp. 119–127). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1620754.1620772
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.