Sentiment Analysis of Tunisian Dialect: Linguistic Resources and Experiments

104Citations
Citations of this article
122Readers
Mendeley users who have this article in their library.

Abstract

Dialectal Arabic (DA) is significantly different from the Arabic language taught in schools and used in written communication and formal speech (broadcast news, religion, politics, etc.). There are many existing researches in the field of Arabic language Sentiment Analysis (SA); however, they are generally restricted to Modern Standard Arabic (MSA) or some dialects of economic or political interest. In this paper we focus on SA of the Tunisian dialect. We use Machine Learning techniques to determine the polarity of comments written in Tunisian dialect. First, we evaluate the SA systems performances with models trained using freely available MSA and Multi-dialectal data sets. We then collect and annotate a Tunisian dialect corpus of 17.000 comments from Facebook. This corpus shows a significant improvement compared to the best model trained on other Arabic dialects or MSA data. We believe that this first freely available12 corpus will be valuable to researchers working in the field of Tunisian Sentiment Analysis and similar areas.

Cite

CITATION STYLE

APA

Mdhaffar, S., Bougares, F., Estève, Y., & Hadrich-Belguith, L. (2017). Sentiment Analysis of Tunisian Dialect: Linguistic Resources and Experiments. In WANLP 2017, co-located with EACL 2017 - 3rd Arabic Natural Language Processing Workshop, Proceedings of the Workshop (pp. 55–61). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-1307

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free