Regenerating hypotheses for statistical machine translation

Boxing Chen; Min Zhang; Aiti Aw; Haizhou Li

Conference Proceedings

Regenerating hypotheses for statistical machine translation

Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference (2008) 1 105-112

DOI: 10.3115/1599081.1599095

7Citations

91Readers

Get full text

Abstract

This paper studies three techniques that improve the quality of N-best hypotheses through additional regeneration process. Unlike the multi-system consensus approach where multiple translation systems are used, our improvement is achieved through the expansion of the N-best hypotheses from a single system. We explore three different methods to implement the regeneration process: re-decoding, n-gram expansion, and confusion network-based regeneration. Experiments on Chinese-to-English NIST and IWSLT tasks show that all three methods obtain consistent improvements. Moreover, the combination of the three strategies achieves further improvements and outperforms the baseline by 0.81 BLEU-score on IWSLT'06, 0.57 on NIST'03, 0.61 on NIST'05 test set respectively. © 2008 Licensed under the Creative Commons.

Cite

CITATION STYLE

APA

Chen, B., Zhang, M., Aw, A., & Li, H. (2008). Regenerating hypotheses for statistical machine translation. In Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 105–112). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1599081.1599095

Regenerating hypotheses for statistical machine translation

Abstract

Cite

Register to see more suggestions