Self-selection bias of similarity metrics in translation memory evaluation

1Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A translation memory system attempts to retrieve useful suggestions from previous translations to assist a translator in a new translation task. While assisting the translator with a specific segment, some similarity metric is usually employed to select the best matches from previously translated segments to present to a translator. Automated methods for evaluating a translation memory system usually use reference translations and some similarity metric. Such evaluation methods might be expected to assist in choosing between competing systems. No single evaluation method has gained widespread use; additionally the similarity metric used in each of these methods is not standardised either. This paper investigates the consequences of substituting the similarity metric in such an evaluation method, and finds that the similarity metrics exhibit a strong bias for the system using the same metric for retrieval. Consequently the choice of similarity metric in the evaluation of translation memory systems should be carefully reconsidered.

Cite

CITATION STYLE

APA

Wolff, F., Pretorius, L., Dugast, L., & Buitelaar, P. (2016). Self-selection bias of similarity metrics in translation memory evaluation. Machine Translation, 30(3–4), 129–144. https://doi.org/10.1007/s10590-016-9185-8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free