Evaluation of machine translation performance across multiple genres and languages

Marlies Van Der Wees; Arianna Bisazza; Christof Monz

Conference Proceedings

Evaluation of machine translation performance across multiple genres and languages

LREC 2018 - 11th International Conference on Language Resources and Evaluation (2018) 3822-3827

11Citations

86Readers

Abstract

In this paper, we present evaluation corpora covering four genres for four language pairs that we harvested from the web in an automated fashion. We use these multi-genre benchmarks to evaluate the impact of genre differences on machine translation (MT). We observe that BLEU score differences between genres can be large and that, for all genres and all language pairs, translation quality improves when using four genre-optimized systems rather than a single genre-agnostic system. Finally, we train and use genre classifiers to route test documents to the most appropriate genre systems. The results of these experiments show that our multi-genre benchmarks can serve to advance research on text genre adaptation for MT.

Author supplied keywords

Cite

CITATION STYLE

APA

Van Der Wees, M., Bisazza, A., & Monz, C. (2018). Evaluation of machine translation performance across multiple genres and languages. In LREC 2018 - 11th International Conference on Language Resources and Evaluation (pp. 3822–3827). European Language Resources Association (ELRA).

Evaluation of machine translation performance across multiple genres and languages

Abstract

Author supplied keywords

Cite

Register to see more suggestions