Dynamic data selection for neural machine translation

Marlies van der Wees; Arianna Bisazza; Christof Monz

Conference ProceedingsOPEN ACCESS

Dynamic data selection for neural machine translation

EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (2017) 1400-1410

DOI: 10.18653/v1/d17-1147

98Citations

209Readers

Abstract

Intelligent selection of training data has proven a successful technique to simultaneously increase training efficiency and translation performance for phrase-based machine translation (PBMT). With the recent increase in popularity of neural machine translation (NMT), we explore in this paper to what extent and how NMT can also benefit from data selection. While state-of-the-art data selection (Axelrod et al., 2011) consistently performs well for PBMT, we show that gains are substantially lower for NMT. Next, we introduce dynamic data selection for NMT, a method in which we vary the selected subset of training data between different training epochs. Our experiments show that the best results are achieved when applying a technique we call gradual fine-tuning, with improvements up to +2.6 BLEU over the original data selection approach and up to +3.1 BLEU over a general baseline.

Cite

CITATION STYLE

APA

van der Wees, M., Bisazza, A., & Monz, C. (2017). Dynamic data selection for neural machine translation. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 1400–1410). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d17-1147

Dynamic data selection for neural machine translation

Abstract

Cite

Register to see more suggestions