We examine the effects of particular orderings of sentence pairs on the on-line training of neural machine translation (NMT). We focus on two types of such orderings: (1) ensuring that each minibatch contains sentences similar in some aspect and (2) gradual inclusion of some sentence types as the training progresses (so called "curriculum learning"). In our English-to-Czech experiments, the internal homogeneity of minibatches has no effect on the training but some of our "curricula" achieve a small improvement over the baseline.
CITATION STYLE
Kocmi, T., & Bojar, O. (2017). Curriculum learning and minibatch bucketing in neural machine translation. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2017-September, pp. 379–386). Incoma Ltd. https://doi.org/10.26615/978-954-452-049-6_050
Mendeley helps you to discover research relevant for your work.