This paper introduces the first emotion-annotated dataset for the Dari variant of Persian spoken in Afghanistan. The LetHerLearn dataset contains 7,600 tweets posted in reaction to the Taliban’s ban of women’s rights to education in 2022 and has been manually annotated according to Ekman’s emotion categories. We here detail the data collection and annotation process, present relevant dataset statistics as well as initial experiments on the resulting dataset, benchmarking a number of different neural architectures for the task of Dari emotion classification.
CITATION STYLE
Hussiny, M. A., & Øvrelid, L. (2023). Emotion Analysis of Tweets Banning Education in Afghanistan. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 271–277). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.wassa-1.24
Mendeley helps you to discover research relevant for your work.