Self-selection bias of similarity metrics in translation memory evaluation

Friedel Wolff; Laurette Pretorius; Loïc Dugast; Paul Buitelaar

Journal Article

Self-selection bias of similarity metrics in translation memory evaluation

Machine Translation (2016) 30(3-4) 129-144

DOI: 10.1007/s10590-016-9185-8

1Citations

6Readers

Get full text

Abstract

A translation memory system attempts to retrieve useful suggestions from previous translations to assist a translator in a new translation task. While assisting the translator with a specific segment, some similarity metric is usually employed to select the best matches from previously translated segments to present to a translator. Automated methods for evaluating a translation memory system usually use reference translations and some similarity metric. Such evaluation methods might be expected to assist in choosing between competing systems. No single evaluation method has gained widespread use; additionally the similarity metric used in each of these methods is not standardised either. This paper investigates the consequences of substituting the similarity metric in such an evaluation method, and finds that the similarity metrics exhibit a strong bias for the system using the same metric for retrieval. Consequently the choice of similarity metric in the evaluation of translation memory systems should be carefully reconsidered.

Author supplied keywords

Cite

CITATION STYLE

APA

Wolff, F., Pretorius, L., Dugast, L., & Buitelaar, P. (2016). Self-selection bias of similarity metrics in translation memory evaluation. Machine Translation, 30(3–4), 129–144. https://doi.org/10.1007/s10590-016-9185-8

Self-selection bias of similarity metrics in translation memory evaluation

Abstract

Author supplied keywords

Cite

Register to see more suggestions