In this paper, we describe our approach for the task of homophobia and transphobia detection in English social media comments. The dataset consists of YouTube comments, and it has been released for the shared task on Homophobia/Transphobia Detection in social media comments. Given the high class imbalance, we propose a solution based on data augmentation and ensemble modeling. We fine-tuned different large language models (BERT, RoBERTa, and HateBERT) and used the weighted majority vote on their predictions. Our proposed model obtained 0.48 and 0.94 for macro and weighted F1-score, respectively, ranking at the third position.
CITATION STYLE
Nozza, D. (2022). Nozza@LT-EDI-ACL2022: Ensemble Modeling for Homophobia and Transphobia Detection. In LTEDI 2022 - 2nd Workshop on Language Technology for Equality, Diversity and Inclusion, Proceedings of the Workshop (pp. 258–264). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.ltedi-1.37
Mendeley helps you to discover research relevant for your work.