Quality estimation for English-Hungarian machine translation systems with optimized semantic features

Zijian Győző Yang; László János Laki; Borbála Siklósi

Conference Proceedings

Quality estimation for English-Hungarian machine translation systems with optimized semantic features

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 9624 LNCS 88-100

DOI: 10.1007/978-3-319-75487-1_8

0Citations

3Readers

Get full text

Abstract

Quality estimation at run-time for machine translation systems is an important task. The standard automatic evaluation methods that use reference translations cannot evaluate MT results in real-time and the correlation between the results of these methods and that of human evaluation is very low in the case of translations from English to Hungarian. The new method to solve this problem is called quality estimation, which addresses the task by estimating the quality of translations as a prediction task for which features are extracted from the source and translated sentences only. In this study, we implement quality estimation for English-Hungarian. First, a corpus is created, which contains Hungarian human judgements. Using these human evaluation scores, different quality estimation models are described, evaluated and optimized. We created a corpus for English-Hungarian quality estimation and we developed 27 new semantic features using WordNet and word embedding models, then we created feature sets optimized for Hungarian, which produced better results than the baseline feature set.

Cite

CITATION STYLE

APA

Yang, Z. G., Laki, L. J., & Siklósi, B. (2018). Quality estimation for English-Hungarian machine translation systems with optimized semantic features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9624 LNCS, pp. 88–100). Springer Verlag. https://doi.org/10.1007/978-3-319-75487-1_8

Quality estimation for English-Hungarian machine translation systems with optimized semantic features

Abstract

Cite

Register to see more suggestions