Code-mixing refers to the phenomenon of using two or more languages interchangeably within a speech or discourse context. This practice is particularly prevalent on social media platforms, and determining the embedded affects in a code-mixed sentence remains as a challenging problem. In this submission we describe our system for WASSA 2023 Shared Task on Emotion Detection in English-Urdu code-mixed text. In our system we implement a multiclass emotion detection model with label space of 11 emotions. Samples are code-mixed English-Urdu text, where Urdu is written in romanised form. Our submission is limited to one of the subtasks - Multi Class classification and we leverage transformer-based Multilingual Large Language Models (MLLMs), XLM-RoBERTa and Indic-BERT. We fine-tune MLLMs on the released data splits, with and without pre-processing steps (translation to english), for classifying texts into the appropriate emotion category. Our methods did not surpass the baseline, and our submission is ranked sixth overall.
CITATION STYLE
Vedula, B. H., Kodali, P., Shrivastava, M., & Kumaraguru, P. (2023). PrecogIIITH@WASSA2023: Emotion Detection for Urdu-English Code-mixed Text. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 601–605). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.wassa-1.58
Mendeley helps you to discover research relevant for your work.