HATE-ITA: Hate Speech Detection in Italian Social Media Text

Debora Nozza; Federico Bianchi; Giuseppe Attanasio

Conference ProceedingsOPEN ACCESS

HATE-ITA: Hate Speech Detection in Italian Social Media Text

WOAH 2022 - 6th Workshop on Online Abuse and Harms, Proceedings of the Workshop (2022) 252-260

DOI: 10.18653/v1/2022.woah-1.24

8Citations

28Readers

Abstract

Online hate speech is a dangerous phenomenon that can (and should) be promptly counteracted properly. While Natural Language Processing has been successfully used for the purpose, many of the research efforts are directed toward the English language. This choice severely limits the classification power in non-English languages. In this paper, we test several learning frameworks for identifying hate speech in Italian text. We release HATE-ITA, a set of multilanguage models trained on a large set of English data and available Italian datasets. HATE-ITA performs better than mono-lingual models and seems to adapt well also on language-specific slurs. We believe our findings will encourage research in other mid-to-low resource communities and provide a valuable benchmarking tool for the Italian community.

Cite

CITATION STYLE

APA

Nozza, D., Bianchi, F., & Attanasio, G. (2022). HATE-ITA: Hate Speech Detection in Italian Social Media Text. In WOAH 2022 - 6th Workshop on Online Abuse and Harms, Proceedings of the Workshop (pp. 252–260). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.woah-1.24

HATE-ITA: Hate Speech Detection in Italian Social Media Text

Abstract

Cite

Register to see more suggestions