Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation

33Citations
Citations of this article
89Readers
Mendeley users who have this article in their library.

Abstract

Parallel corpora are often not as parallel as one might assume: non-literal translations and noisy translations abound, even in curated corpora routinely used for training and evaluation. We use a cross-lingual textual entailment system to distinguish sentence pairs that are parallel in meaning from those that are not, and show that filtering out divergent examples from training improves translation quality.

Cite

CITATION STYLE

APA

Carpuat, M., Vyas, Y., & Niu, X. (2017). Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 69–79). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-3209

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free