PrecogIIITH@WASSA2023: Emotion Detection for Urdu-English Code-mixed Text

2Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

Code-mixing refers to the phenomenon of using two or more languages interchangeably within a speech or discourse context. This practice is particularly prevalent on social media platforms, and determining the embedded affects in a code-mixed sentence remains as a challenging problem. In this submission we describe our system for WASSA 2023 Shared Task on Emotion Detection in English-Urdu code-mixed text. In our system we implement a multiclass emotion detection model with label space of 11 emotions. Samples are code-mixed English-Urdu text, where Urdu is written in romanised form. Our submission is limited to one of the subtasks - Multi Class classification and we leverage transformer-based Multilingual Large Language Models (MLLMs), XLM-RoBERTa and Indic-BERT. We fine-tune MLLMs on the released data splits, with and without pre-processing steps (translation to english), for classifying texts into the appropriate emotion category. Our methods did not surpass the baseline, and our submission is ranked sixth overall.

Cite

CITATION STYLE

APA

Vedula, B. H., Kodali, P., Shrivastava, M., & Kumaraguru, P. (2023). PrecogIIITH@WASSA2023: Emotion Detection for Urdu-English Code-mixed Text. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 601–605). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.wassa-1.58

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free