Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a Statistical Machine Translation system

Marta R. Costa-Jussà; José A.R. Fonollosa

Conference Proceedings

Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a Statistical Machine Translation system

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2007) 171-176

DOI: 10.3115/1626355.1626376

7Citations

88Readers

Get full text

Abstract

One main challenge of statistical machine translation (SMT) is dealing with word order. The main idea of the statistical machine reordering (SMR) approach is to use the powerful techniques of SMT systems to generate a weighted reordering graph for SMT systems. This technique supplies reordering constraints to an SMT system, using statistical criteria. In this paper, we experiment with different graph pruning which guarantees the translation quality improvement due to reordering at a very low increase of computational cost. The SMR approach is capable of generalizing reorderings, which have been learned during training, by using word classes instead of words themselves. We experiment with statistical and morphological classes in order to choose those which capture the most probable reorderings. Satisfactory results are reported in the WMT07 Es/En task. Our system outperforms in terms of BLEU the WMT07 Official baseline system.

Cite

CITATION STYLE

APA

Costa-Jussà, M. R., & Fonollosa, J. A. R. (2007). Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a Statistical Machine Translation system. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 171–176). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1626355.1626376

Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a Statistical Machine Translation system

Abstract

Cite

Register to see more suggestions