Compact and robust models for Japanese-English character-level machine translation

Jinan Dai; Kazunori Yamaguchi

Conference ProceedingsOPEN ACCESS

Compact and robust models for Japanese-English character-level machine translation

WAT@EMNLP-IJCNLP 2019 - 6th Workshop on Asian Translation, Proceedings (2021) 36-44

DOI: 10.18653/v1/d19-5202

0Citations

65Readers

Abstract

Character-level translation has been proved to be able to achieve preferable translation quality without explicit segmentation, but training a character-level model needs a lot of hardware resources. In this paper, we introduced two character-level translation models which are mid-gated model and multi-attention model for Japanese-English translation. We showed that the mid-gated model achieved the better performance with respect to BLEU scores. We also showed that a relatively narrow beam of width 4 or 5 was sufficient for the mid-gated model. As for unknown words, we showed that the mid-gated model could somehow translate the one containing Katakana by coining out a close word. We also showed that the model managed to produce tolerable results for heavily noised sentences, even though the model was trained with the dataset without noise.

References Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Dai, J., & Yamaguchi, K. (2021). Compact and robust models for Japanese-English character-level machine translation. In WAT@EMNLP-IJCNLP 2019 - 6th Workshop on Asian Translation, Proceedings (pp. 36–44). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d19-5202

Readers' Seniority

PhD / Post grad / Masters / Doc 17

71%

Researcher 5

21%

Lecturer / Post doc 2

Readers' Discipline

Computer Science 21

72%

Linguistics 5

17%

Business, Management and Accounting 2

Neuroscience 1

Compact and robust models for Japanese-English character-level machine translation

Abstract

References Powered by Scopus

Deep residual learning for image recognition

Learning phrase representations using RNN encoder-decoder for statistical machine translation

Neural machine translation of rare words with subword units

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline