Combining morphosyntactic enriched representation with n-best reranking in statistical translation

H. Bonneau-Maynard; A. Allauzen; D. Déchelotte; H. Schwenk

Conference Proceedings

Combining morphosyntactic enriched representation with n-best reranking in statistical translation

Proceedings of NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation, SSST 2007 (2007) 65-71

DOI: 10.3115/1626281.1626290

5Citations

74Readers

Get full text

Abstract

The purpose of this work is to explore the integration of morphosyntactic information into the translation model itself, by enriching words with their morphosyntactic categories. We investigate word disambiguation using morphosyntactic categories, n-best hypotheses reranking, and the combination of both methods with word or morphosyntactic n-gram language model reranking. Experiments are carried out on the English-to-Spanish translation task. Using the morphosyntactic language model alone does not results in any improvement in performance. However, combining morphosyntactic word disambiguation with a word based 4-gram language model results in a relative improvement in the BLEU score of 2.3% on the development set and 1.9% on the test set.

Cite

CITATION STYLE

APA

Bonneau-Maynard, H., Allauzen, A., Déchelotte, D., & Schwenk, H. (2007). Combining morphosyntactic enriched representation with n-best reranking in statistical translation. In Proceedings of NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation, SSST 2007 (pp. 65–71). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1626281.1626290

Combining morphosyntactic enriched representation with n-best reranking in statistical translation

Abstract

Cite

Register to see more suggestions