In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both sentence-level transformer models and compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches, and we also included a rule-based system for English-Finnish.
CITATION STYLE
Talman, A., Sulubacak, U., Vázquez, R., Scherrer, Y., Virpioja, S., Raganato, A., … Tiedemann, J. (2019). The University of Helsinki submissions to the WMT19 news translation task. In WMT 2019 - 4th Conference on Machine Translation, Proceedings of the Conference (Vol. 2, pp. 412–423). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5347
Mendeley helps you to discover research relevant for your work.