iREL at SemEval-2023 Task 9: Improving understanding of multilingual Tweets using Translation-Based Augmentation and Domain Adapted Pre-Trained Models

1Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper describes our system (iREL) for Tweet intimacy analysis shared task of the Se-mEval 2023 workshop at ACL 2023. Our system achieved an overall Pearson’s r score of 0.5924 and ranked 10th on the overall leaderboard. For the unseen languages, we ranked third on the leaderboard and achieved a Pearson’s r score of 0.485. We used a single multilingual model for all languages, as discussed in this paper. We provide a detailed description of our pipeline along with multiple ablation experiments to further analyse each component of the pipeline. We demonstrate how translation-based augmentation, domain-specific features, and domain-adapted pretrained models improve the understanding of intimacy in tweets. The code can be found at https://github.com/bhavyajeet/Multilingual-tweet-intimacy.

Cite

CITATION STYLE

APA

Singh, B., Maity, A., Kandru, P., Hari, A., & Varma, V. (2023). iREL at SemEval-2023 Task 9: Improving understanding of multilingual Tweets using Translation-Based Augmentation and Domain Adapted Pre-Trained Models. In 17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop (pp. 2052–2057). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.semeval-1.282

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free