Abstract
This paper presents a translationese study based on the parallel data from the Russian National Corpus (RNC). We explored differences between literary texts originally authored in Russian and fiction translated into Russian from 11 languages. The texts are represented with frequency-based features that capture structural and lexical properties of language. Binary classification results indicate that literary translations can be distinguished from non-translations with an accuracy ranging from 82 to 92% depending on the source language and feature set. Multiclass classification confirms that translations from distant languages are more distinct from non-translations than translations from languages that are typologically close to Russian. It also demonstrates that translations from same-family source languages share translationese properties. Structural features return more consistent results than features relying on external resources and capturing lexical properties of texts in both translationese detection and source language identification tasks.
Cite
CITATION STYLE
Kunilovskaya, M., Lapshinova-Koltunski, E., & Mitkov, R. (2021). Fiction in Russian Translation: A Translationese Study. In International Conference Recent Advances in Natural Language Processing, RANLP (pp. 734–743). Incoma Ltd. https://doi.org/10.26615/978-954-452-072-4_084
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.