Abstract
In this paper, we present our findings in the two subtasks of the 2022 NADI shared task. First, in the Arabic dialect identification subtask, we find that there is heavy class imbalance, and propose to address this issue using focal loss. Our experiments with the focusing hyperparameter confirm that focal loss improves performance. Second, in the Arabic tweet sentiment analysis subtask, we deal with a smaller dataset, where text includes both Arabic dialects and Modern Standard Arabic. We propose to use transfer learning from both pre-trained MSA language models and our own model from the first subtask. Our system ranks in the 5th and 7th best spots of the leaderboards of first and second subtasks respectively.
Cite
CITATION STYLE
El-Shangiti, A. O., & Mrini, K. (2022). Ahmed and Khalil at NADI 2022: Transfer Learning and Addressing Class Imbalance for Arabic Dialect Identification and Sentiment Analysis. In WANLP 2022 - 7th Arabic Natural Language Processing - Proceedings of the Workshop (pp. 442–446). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.wanlp-1.46
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.