How do humans evaluate machine translation

Francisco Guzmán; Ahmed Abdelali; Irina Temnikova; Hassan Sajjad; Stephan Vogel

Conference ProceedingsOPEN ACCESS

How do humans evaluate machine translation

10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Proceedings (2015) 457-466

DOI: 10.18653/v1/w15-3059

9Citations

89Readers

Abstract

In this paper, we take a closer look at the MT evaluation process from a glass-box perspective using eye-tracking. We analyze two aspects of the evaluation task - the background of evaluators (monolingual or bilingual) and the sources of information available, and we evaluate them using time and consistency as criteria. Our findings show that monolinguals are slower but more consistent than bilinguals, especially when only target language information is available. When exposed to various sources of information, evaluators in general take more time and in the case of monolinguals, there is a drop in consistency. Our findings suggest that to have consistent and cost effective MT evaluations, it is better to use monolinguals with only target language information.

Cite

CITATION STYLE

APA

Guzmán, F., Abdelali, A., Temnikova, I., Sajjad, H., & Vogel, S. (2015). How do humans evaluate machine translation. In 10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Proceedings (pp. 457–466). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-3059

How do humans evaluate machine translation

Abstract

Cite

Register to see more suggestions