Information density and quality estimation features as translationese indicators for human translation classification

Raphael Rubino; Ekaterina Lapshinova-Koltunski; Josef Van Genabith

Conference ProceedingsOPEN ACCESS

Information density and quality estimation features as translationese indicators for human translation classification

2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (2016) 960-970

DOI: 10.18653/v1/n16-1110

28Citations

91Readers

Abstract

This paper introduces information density and machine translation quality estimation inspired features to automatically detect and classify human translated texts. We investigate two settings: discriminating between translations and comparable originally authored texts, and distinguishing two levels of translation professionalism. Our framework is based on delexicalised sentence-level dense feature vector representations combined with a supervised machine learning approach. The results show state-of-the-art performance for mixed-domain translationese detection with information density and quality estimation based features, while results on translation expertise classification are mixed.

Cite

CITATION STYLE

APA

Rubino, R., Lapshinova-Koltunski, E., & Van Genabith, J. (2016). Information density and quality estimation features as translationese indicators for human translation classification. In 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (pp. 960–970). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n16-1110

Information density and quality estimation features as translationese indicators for human translation classification

Abstract

Cite

Register to see more suggestions