Abstract
This paper proposes a hybrid machine-translation system that combines neural machine translation with well-developed rule-based machine translation to utilize the stability of the latter to compensate for the inadequacy of neural machine translation in rare-resource domains. A classifier is introduced to predict which translation from the two systems is more reliable. We explore a set of features that reflect the reliability of translation and its process, and training data is automatically expanded with a small, human-labeled dataset to solve the insufficient-data problem. A series of experiments shows that the hybrid system’s translation accuracy is improved, especially in out-of-domain translations, and classification accuracy is greatly improved when using the proposed features and the automatically constructed training set. A comparison between feature-and text-based classification is also performed, and the results show that the feature-based model achieves better classification accuracy, even when compared to neural network text classifiers.
Author supplied keywords
Cite
CITATION STYLE
Huang, J. X., Lee, K. S., & Kim, Y. K. (2020). Hybrid translation with classification: Revisiting rule-based and neural machine translation. Electronics (Switzerland), 9(2). https://doi.org/10.3390/electronics9020201
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.