TMop: A tool for unsupervised Translation Memory cleaning

4Citations
Citations of this article
75Readers
Mendeley users who have this article in their library.

Abstract

We present TMop, the first open-source tool for automatic Translation Memory (TM) cleaning. The tool implements a fully unsupervised approach to the task, which allows spotting unreliable translation units (sentence pairs in different languages, which are supposed to be translations of each other) without requiring labeled training data. TMop includes a highly configurable and extensible set of filters capturing different aspects of translation quality. It has been evaluated on a test set composed of 1,000 translation units (TUs) randomly extracted from the English-Italian version of MyMemory, a large-scale public TM. Results indicate its effectiveness in automatic removing "bad" TUs, with comparable performance to a state-of-the-art supervised method (76.3 vs. 77.7 balanced accuracy).

Cite

CITATION STYLE

APA

Sabet, M. J., Negri, M., Turchi, M., De Souza, J. G. C., & Federico, M. (2016). TMop: A tool for unsupervised Translation Memory cleaning. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - System Demonstrations (pp. 49–54). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-4009

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free