Can Synthetic Translations Improve Bitext Quality?

6Citations
Citations of this article
42Readers
Mendeley users who have this article in their library.

Abstract

Synthetic translations have been used for a wide range of NLP tasks primarily as a means of data augmentation. This work explores, instead, how synthetic translations can be used to revise potentially imperfect reference translations in mined bitext. We find that synthetic samples can improve bitext quality without any additional bilingual supervision when they replace the originals based on a semantic equivalence classifier that helps mitigate NMT noise. The improved quality of the revised bitext is confirmed intrinsically via human evaluation and extrinsically through bilingual induction and MT tasks.

Cite

CITATION STYLE

APA

Briakou, E., & Carpuat, M. (2022). Can Synthetic Translations Improve Bitext Quality? In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 4753–4766). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.acl-long.326

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free