A parallel corpus of translationese

Ella Rabinovich; Shuly Wintner; Ofek Luis Lewinsohn

Conference Proceedings

A parallel corpus of translationese

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 9624 LNCS 140-155

DOI: 10.1007/978-3-319-75487-1_12

2Citations

17Readers

Get full text

Abstract

We describe a set of bilingual English-French and English-German parallel corpora in which the direction of translation is accurately and reliably annotated. The corpora are diverse, consisting of parliamentary proceedings, literary works, transcriptions of TED talks and political commentary. They will be instrumental for research of translationese and its applications to (human and machine) translation; specifically, they can be used for the task of translationese identification, a research direction that enjoys a growing interest in recent years. To validate the quality and reliability of the corpora, we replicated previous results of supervised and unsupervised identification of translationese, and further extended the experiments to additional datasets and languages.

Author supplied keywords

Cite

CITATION STYLE

APA

Rabinovich, E., Wintner, S., & Lewinsohn, O. L. (2018). A parallel corpus of translationese. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9624 LNCS, pp. 140–155). Springer Verlag. https://doi.org/10.1007/978-3-319-75487-1_12

A parallel corpus of translationese

Abstract

Author supplied keywords

Cite

Register to see more suggestions