Abstract
This paper describes an efficient method to extract large n-best lists from a word graph produced by a statistical machine translation system. The extraction is based on the k shortest paths algorithm which is efficient even for very large k. We show that, although we can generate large amounts of distinct translation hypotheses, these numerous candidates are not able to significantly improve overall system performance. We conclude that large n-best lists would benefit from better discriminating models.
Cite
CITATION STYLE
Hasan, S., Zens, R., & Ney, H. (2007). Are very large N-best lists useful for SMT? In NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers (pp. 57–60). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1614108.1614123
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.