Hybrid translation with classification: Revisiting rule-based and neural machine translation

21Citations
Citations of this article
43Readers
Mendeley users who have this article in their library.

Abstract

This paper proposes a hybrid machine-translation system that combines neural machine translation with well-developed rule-based machine translation to utilize the stability of the latter to compensate for the inadequacy of neural machine translation in rare-resource domains. A classifier is introduced to predict which translation from the two systems is more reliable. We explore a set of features that reflect the reliability of translation and its process, and training data is automatically expanded with a small, human-labeled dataset to solve the insufficient-data problem. A series of experiments shows that the hybrid system’s translation accuracy is improved, especially in out-of-domain translations, and classification accuracy is greatly improved when using the proposed features and the automatically constructed training set. A comparison between feature-and text-based classification is also performed, and the results show that the feature-based model achieves better classification accuracy, even when compared to neural network text classifiers.

Cite

CITATION STYLE

APA

Huang, J. X., Lee, K. S., & Kim, Y. K. (2020). Hybrid translation with classification: Revisiting rule-based and neural machine translation. Electronics (Switzerland), 9(2). https://doi.org/10.3390/electronics9020201

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free