Cross-cultural transfer learning for text classification

7Citations
Citations of this article
105Readers
Mendeley users who have this article in their library.

Abstract

Large training datasets are required to achieve competitive performance in most natural language tasks. The acquisition process for these datasets is labor intensive, expensive, and time consuming. This process is also prone to human errors. In this work, we show that cross-cultural differences can be harnessed for natural language text classification. We present a transfer-learning framework that leverages widely-available unaligned bilingual corpora for classification tasks, using no task-specific data. Our empirical evaluation on two tasks - formality classification and sarcasm detection - shows that the cross-cultural difference between German and American English, as manifested in product review text, can be applied to achieve good performance for formality classification, while the difference between Japanese and American English can be applied to achieve good performance for sarcasm detection - both without any task-specific labeled data.

Cite

CITATION STYLE

APA

Ringel, D., Lavee, G., Guy, I., & Radinsky, K. (2019). Cross-cultural transfer learning for text classification. In EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (pp. 3873–3883). Association for Computational Linguistics. https://doi.org/10.18653/v1/D19-1400

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free