MEANT 2.0: Accurate semantic MT evaluation for any output language

Chi Kiu Lo

Conference ProceedingsOPEN ACCESS

MEANT 2.0: Accurate semantic MT evaluation for any output language

Lo C

WMT 2017 - 2nd Conference on Machine Translation, Proceedings (2017) 589-597

DOI: 10.18653/v1/w17-4767

39Citations

73Readers

Abstract

We describe a new version of MEANT, which participated in the metrics task of the Second Conference on Machine Translation (WMT 2017). MEANT 2.0 uses idf-weighted distributional ngram accuracy to determine the phrasal similarity of semantic role fillers and yields better correlations with human judgments of translation quality than earlier versions. The improved phrasal similarity enables a subversion of MEANT to accurately evaluate translation adequacy for any output language, even languages without an automatic semantic parser. Our results show that MEANT, which is a non-ensemble and untrained metric, consistently performs as well as the top participants in previous years - including ensemble and trained ones - across different output languages. We also present the timing statistics for MEANT for better estimation of the evaluation cost. MEANT 2.0 is open source and publicly available.

Cite

CITATION STYLE

APA

Lo, C. K. (2017). MEANT 2.0: Accurate semantic MT evaluation for any output language. In WMT 2017 - 2nd Conference on Machine Translation, Proceedings (pp. 589–597). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-4767

MEANT 2.0: Accurate semantic MT evaluation for any output language

Abstract

Cite

Register to see more suggestions