A Weak Supervised Transfer Learning Approach for Sentiment Analysis to the Kuwaiti Dialect

12Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Developing a system for sentiment analysis is very challenging for the Arabic language due to the limitations in the available Arabic datasets. Many Arabic dialects are still not studied by researchers in Arabic sentiment analysis due to the complexity of annotators' recruitment process during dataset creation. This paper covers the research gap in sentiment analysis for the Kuwaiti dialect by proposing a weak supervised approach to develop a large labeled dataset. Our dataset consists of over 16.6k tweets with 7,905 negatives, 7,902 positives, and 860 neutrals that spans several themes and time frames to remove any bias that might affect its content. The annotation agreement between our proposed system's labels and human-annotated labels reports 93% for the pairwise percent agreement and 0.87 for Cohen's kappa coefficient. Furthermore, we evaluate our dataset using multiple traditional machine learning classifiers and advanced deep learning language models to test its performance. The results report 89% accuracy when applied to the testing dataset using the ARBERT model.

Cite

CITATION STYLE

APA

Husain, F., Al-Ostad, H., & Omar, H. (2022). A Weak Supervised Transfer Learning Approach for Sentiment Analysis to the Kuwaiti Dialect. In WANLP 2022 - 7th Arabic Natural Language Processing - Proceedings of the Workshop (pp. 161–173). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.wanlp-1.15

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free