UMUTeam@LT-EDI-ACL2022: Detecting homophobic and transphobic comments in Tamil

12Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This working-notes are about the participation of the UMUTeam in a LT-EDI shared task concerning the identification of homophobic and transphobic comments in YouTube. These comments are written in English, which has high availability to machine-learning resources; Tamil, which has fewer resources; and a transliteration from Tamil to Roman script combined with English sentences. To carry out this shared task, we train a neural network that combines several feature sets applying a knowledge integration strategy. These features are linguistic features extracted from a tool developed by our research group and contextual and non-contextual sentence embeddings. We ranked 7th for English subtask (macro f1-score of 45%), 3rd for Tamil subtask (macro f1-score of 82%), and 2nd for Tamil-English subtask (macro f1-score of 58%).

Cite

CITATION STYLE

APA

García-Díaz, J. A., Caparrós-Laiz, C., & Valencia-García, R. (2022). UMUTeam@LT-EDI-ACL2022: Detecting homophobic and transphobic comments in Tamil. In LTEDI 2022 - 2nd Workshop on Language Technology for Equality, Diversity and Inclusion, Proceedings of the Workshop (pp. 140–144). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.ltedi-1.16

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free