Improved generalization of arabic text classifiers

10Citations
Citations of this article
84Readers
Mendeley users who have this article in their library.

Abstract

While transfer learning for text has been very active in the English language, progress in Arabic has been slow, including the use of Domain Adaptation (DA). Domain Adaptation is used to generalize the performance of any classifier by trying to balance the classifier's accuracy for a particular task among different text domains. In this paper, we propose and evaluate two variants of a domain adaptation technique: The first is a base model called Domain Adversarial Neural Network (DANN), while the second is a variation that incorporates representational learning. Similar to previous approaches, we propose the use of proxy A-distance as a metric to assess the success of generalization. We make use of ArSentDLEV, a multi-topic dataset collected from the Levantine countries, to test the performance of the models. We show the superiority of the proposed method in accuracy and robustness when dealing with the Arabic language.

Cite

CITATION STYLE

APA

Khaddaj, A., Hajj, H., & El-Hajj, W. (2019). Improved generalization of arabic text classifiers. In ACL 2019 - 4th Arabic Natural Language Processing Workshop, WANLP 2019 - Proceedings of the Workshop (pp. 167–174). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-4618

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free