Building Neural Machine Translation Systems for Multilingual Participatory Spaces

Pintu Lohar; Guodong Xie; Daniel Gallagher; Andy Way

Journal ArticleOPEN ACCESS

Building Neural Machine Translation Systems for Multilingual Participatory Spaces

Lohar P
Xie G
Gallagher D
et al.

Analytics (2023) 2(2) 393-409

DOI: 10.3390/analytics2020022

N/ACitations

5Readers

Abstract

This work presents the development of the translation component in a multistage, multilevel, multimode, multilingual and dynamic deliberative (M4D2) system, built to facilitate automated moderation and translation in the languages of five European countries: Italy, Ireland, Germany, France and Poland. Two main topics were to be addressed in the deliberation process: (i) the environment and climate change; and (ii) the economy and inequality. In this work, we describe the development of neural machine translation (NMT) models for these domains for six European languages: Italian, English (included as the second official language of Ireland), Irish, German, French and Polish. As a result, we generate 30 NMT models, initially baseline systems built using freely available online data, which are then adapted to the domains of interest in the project by (i) filtering the corpora, (ii) tuning the systems with automatically extracted in-domain development datasets and (iii) using corpus concatenation techniques to expand the amount of data available. We compare our results produced by the domain-adapted systems with those produced by Google Translate, and demonstrate that fast, high-quality systems can be produced that facilitate multilingual deliberation in a secure environment.

Cite

CITATION STYLE

APA

Lohar, P., Xie, G., Gallagher, D., & Way, A. (2023). Building Neural Machine Translation Systems for Multilingual Participatory Spaces. Analytics, 2(2), 393–409. https://doi.org/10.3390/analytics2020022

Building Neural Machine Translation Systems for Multilingual Participatory Spaces

Abstract

Cite

Register to see more suggestions