Non-parametric Online Learning from Human Feedback for Neural Machine Translation

17Citations
Citations of this article
21Readers
Mendeley users who have this article in their library.

Abstract

in the human-in-the-loop machine translation, in which the human translators revise the machine-generated translations and then the corrected translations are used to improve the neural machine translation (NMT) system. However, previous methods require online model updating or additional translation memory networks to achieve high-quality performance, making them inflexible and inefficient in practice. In this paper, we propose a novel non-parametric online learning method without changing the model structure. This approach introduces two k-nearest-neighbor (KNN) modules: one module memorizes the human feedback, which is the correct sentences provided by human translators, while the other balances the usage of the history human feedback and original NMT models adaptively. Experiments conducted on EMEA and JRC-Acquis benchmarks demonstrate that our proposed method obtains substantial improvements on translation accuracy and achieves better adaptation performance with less repeating human correction operations.

Cite

CITATION STYLE

APA

Wang, D., Wei, H., Zhang, Z., Huang, S., Xie, J., & Chen, J. (2022). Non-parametric Online Learning from Human Feedback for Neural Machine Translation. In Proceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022 (Vol. 36, pp. 11431–11439). Association for the Advancement of Artificial Intelligence. https://doi.org/10.1609/aaai.v36i10.21395

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free