Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

0Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.

Cite

CITATION STYLE

APA

Shi, X., Huang, H., Jian, P., & Tang, Y. K. (2021). Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12869 LNAI, pp. 3–15). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-84186-7_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free