Emotion Classification in a Resource Constrained Language Using Transformer-based Approach

25Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.

Abstract

Although research on emotion classification has significantly progressed in high-resource languages, it is still infancy for resource-constrained languages like Bengali. However, unavailability of necessary language processing tools and deficiency of benchmark corpora makes the emotion classification task in Bengali more challenging and complicated. This work proposes a transformer-based technique to classify the Bengali text into one of the six basic emotions: anger, fear, disgust, sadness, joy, and surprise. A Bengali emotion corpus consists of 6243 texts is developed for the classification task. Experimentation carried out using various machine learning (LR, RF, MNB, SVM), deep neural networks (CNN, BiLSTM, CNN+BiLSTM) and transformer (Bangla-BERT, m-BERT, XLM-R) based approaches. Experimental outcomes indicate that XLM-R outdoes all other techniques by achieving the highest weighted f1-score of 69.73% on the test data. The dataset is publicly available at https://github.com/omar-sharif03/ NAACL-SRW-2021.

Cite

CITATION STYLE

APA

Das, A., Sharif, O., Hoque, M. M., & Sarker, I. H. (2021). Emotion Classification in a Resource Constrained Language Using Transformer-based Approach. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Student Research Workshop (pp. 150–158). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-srw.19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free