Abstract
This paper studies three techniques that improve the quality of N-best hypotheses through additional regeneration process. Unlike the multi-system consensus approach where multiple translation systems are used, our improvement is achieved through the expansion of the N-best hypotheses from a single system. We explore three different methods to implement the regeneration process: re-decoding, n-gram expansion, and confusion network-based regeneration. Experiments on Chinese-to-English NIST and IWSLT tasks show that all three methods obtain consistent improvements. Moreover, the combination of the three strategies achieves further improvements and outperforms the baseline by 0.81 BLEU-score on IWSLT'06, 0.57 on NIST'03, 0.61 on NIST'05 test set respectively. © 2008 Licensed under the Creative Commons.
Cite
CITATION STYLE
Chen, B., Zhang, M., Aw, A., & Li, H. (2008). Regenerating hypotheses for statistical machine translation. In Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 105–112). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1599081.1599095
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.