State-of-the-art translation Quality Estimation (QE) models are proven to be biased. More specifically, they over-rely on monolingual features while ignoring the bilingual semantic alignment. In this work, we propose a novel method to mitigate the bias of the QE model and improve estimation performance. Our method is based on the contrastive learning between clean and noisy sentence pairs. We first introduce noise to the target side of the parallel sentence pair, forming the negative samples. With the original parallel pairs as the positive sample, the QE model is contrastively trained to distinguish the positive samples from the negative ones. This objective is jointly trained with the regression-style quality estimation, so as to prevent the QE model from overfitting to monolingual features. Experiments on WMT QE evaluation datasets demonstrate that our method improves the estimation performance by a large margin while mitigating the bias.
CITATION STYLE
Huang, H., Wu, S., Chen, K., Di, H., Yang, M., & Zhao, T. (2023). Improving Translation Quality Estimation with Bias Mitigation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 1, pp. 2175–2190). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.acl-long.121
Mendeley helps you to discover research relevant for your work.