Complexity of finding the BLEU-optimal hypothesis in a confusion network

Gregor Leusch; Evgeny Matusov; Hermann Ney

Conference ProceedingsOPEN ACCESS

Complexity of finding the BLEU-optimal hypothesis in a confusion network

EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL (2008) 839-847

DOI: 10.3115/1613715.1613823

10Citations

88Readers

Abstract

Confusion networks are a simple representation of multiple speech recognition or translation hypotheses in a machine translation system. A typical operation on a confusion network is to find the path which minimizes or maximizes a certain evaluation metric. In this article, we show that this problem is generally NP-hard for the popular BLEU metric, as well as for smaller variants of BLEU. This also holds for more complex representations like generic word graphs. In addition, we give an efficient polynomial-time algorithm to calculate unigram BLEU on confusion networks, but show that even small generalizations of this data structure render the problem to be NP-hard again. Since finding the optimal solution is thus not always feasible, we introduce an approximating algorithm based on a multi-stack decoder, which finds a (not necessarily optimal) solution for n-gram BLEU in polynomial time. © 2008 Association for Computational Linguistics.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Leusch, G., Matusov, E., & Ney, H. (2008). Complexity of finding the BLEU-optimal hypothesis in a confusion network. In EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL (pp. 839–847). https://doi.org/10.3115/1613715.1613823

Readers' Seniority

PhD / Post grad / Masters / Doc 28

62%

Researcher 10

22%

Professor / Associate Prof. 4

Lecturer / Post doc 3

Readers' Discipline

Computer Science 38

78%

Linguistics 6

12%

Social Sciences 3

Business, Management and Accounting 2

Complexity of finding the BLEU-optimal hypothesis in a confusion network

Abstract

References Powered by Scopus

The complexity of phrase alignment problems

Speech translation by confusion network decoding

Comparing reordering constraints for SMT using efficient BLEU oracle computation

Cited by Powered by Scopus

Online large-margin training of syntactic and structural translation features

Decomposability of translation metrics for improved evaluation and efficient algorithms

Optimization for statistical machine translation: A survey

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline