Coarse-To-Fine Learning for Neural Machine Translation

Zhirui Zhang; Shujie Liu; Mu Li; Ming Zhou; Enhong Chen

Conference Proceedings

Coarse-To-Fine Learning for Neural Machine Translation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11108 LNAI 316-328

DOI: 10.1007/978-3-319-99495-6_27

2Citations

8Readers

Get full text

Abstract

In this paper, we address the problem of learning better word representations for neural machine translation (NMT). We propose a novel approach to NMT model training based on coarse-to-fine learning paradigm, which is able to infer better NMT model parameters for a wide range of less-frequent words in the vocabulary. To this end, our proposed method first groups source and target words into a set of hierarchical clusters, then a sequence of NMT models are learned based on it with growing cluster granularity. Each subsequent model inherits model parameters from its previous one and refines them with finer-grained word-cluster mapping. Experimental results on public data sets demonstrate that our proposed method significantly outperforms baseline attention-based NMT model on Chinese-English and English-French translation tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, Z., Liu, S., Li, M., Zhou, M., & Chen, E. (2018). Coarse-To-Fine Learning for Neural Machine Translation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11108 LNAI, pp. 316–328). Springer Verlag. https://doi.org/10.1007/978-3-319-99495-6_27

Coarse-To-Fine Learning for Neural Machine Translation

Abstract

Author supplied keywords

Cite

Register to see more suggestions