This paper introduces information density and machine translation quality estimation inspired features to automatically detect and classify human translated texts. We investigate two settings: discriminating between translations and comparable originally authored texts, and distinguishing two levels of translation professionalism. Our framework is based on delexicalised sentence-level dense feature vector representations combined with a supervised machine learning approach. The results show state-of-the-art performance for mixed-domain translationese detection with information density and quality estimation based features, while results on translation expertise classification are mixed.
CITATION STYLE
Rubino, R., Lapshinova-Koltunski, E., & Van Genabith, J. (2016). Information density and quality estimation features as translationese indicators for human translation classification. In 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (pp. 960–970). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n16-1110
Mendeley helps you to discover research relevant for your work.