Neural machine translation via binary code prediction

Yusuke Oda; Philip Arthur; Graham Neubig; Koichiro Yoshino; Satoshi Nakamura

Conference ProceedingsOPEN ACCESS

Neural machine translation via binary code prediction

ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (2017) 1 850-860

DOI: 10.18653/v1/P17-1079

6Citations

168Readers

Abstract

In this paper, we propose a new method for calculating the output layer in neural machine translation systems. The method is based on predicting a binary code for each word and can reduce computation time/memory requirements of the output layer to be logarithmic in vocabulary size in the best case. In addition, we also introduce two advanced approaches to improve the robustness of the proposed model: using error-correcting codes and combining softmax and binary codes. Experiments on two English ? Japanese bidirectional translation tasks show proposed models achieve BLEU scores that approach the softmax, while reducing memory usage to the order of less than 1/10 and improving decoding speed on CPUs by x5 to x10.

Cite

CITATION STYLE

APA

Oda, Y., Arthur, P., Neubig, G., Yoshino, K., & Nakamura, S. (2017). Neural machine translation via binary code prediction. In ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (Vol. 1, pp. 850–860). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/P17-1079

Neural machine translation via binary code prediction

Abstract

Cite

Register to see more suggestions