Abstract
Nowadays, intimacy is a fundamental aspect of how we relate to other people in social settings. The most frequent way in which we can determine a high level of intimacy is in the use of certain emoticons, curse words, verbs, etc. This paper presents the approach developed to solve SemEval 2023 task 9: Multiligual Tweet Intimacy Analysis. To address the task, a transfer-learning approach was conducted by fine-tuning various pre-trained language models. Since the dataset supplied by the organizer was highly imbalanced, our main strategy to obtain high prediction values was the implementation of round-trip translation for oversampling and a random approach for undersampling on the training set. Our final submission achieved an overall Pearson’s r of 0.497.
Cite
CITATION STYLE
Estévez, A. P., Vázquez, J. M., Álvarez, V. P., & El Balima Cordero, N. (2023). I2C-Huelva at SemEval-2023 Task 9: Analysis of Intimacy in Multilingual Tweets Using Resampling Methods and Transformers. In 17th International Workshop on Semantic Evaluation, SemEval 2023 - Proceedings of the Workshop (pp. 758–762). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.semeval-1.104
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.