On Machine Translation of User Reviews

3Citations
Citations of this article
42Readers
Mendeley users who have this article in their library.

Abstract

This work investigates neural machine translation (NMT) systems for translating English user reviews into Croatian and Serbian, two similar morphologically complex languages. Two types of reviews are used for testing the systems: IMDb movie reviews and Amazon product reviews. Two types of training data are explored: large out-of-domain bilingual parallel corpora, as well as small synthetic in-domain parallel corpus obtained by machine translation of monolingual English Amazon reviews into the target languages. Both automatic scores and human evaluation show that using the synthetic in-domain corpus together with a selected subset of out-of-domain data is the best option. Separated results on IMDb and Amazon reviews indicate that MT systems perform differently on different review types so that user reviews generally should not be considered as a homogeneous genre. Nevertheless, more detailed research on larger amount of different reviews covering different domains/topics is needed to fully understand these differences.

Cite

CITATION STYLE

APA

Popovic, M., Way, A., Poncelas, A., & Bakaric, M. B. (2021). On Machine Translation of User Reviews. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 1109–1118). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_124

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free