Towards Weakly-Supervised Hate Speech Classification Across Datasets

2Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

As pointed out by several scholars, current research on hate speech (HS) recognition is characterized by unsystematic data creation strategies and diverging annotation schemata. Subsequently, supervised-learning models tend to generalize poorly to datasets they were not trained on, and the performance of the models trained on datasets labeled using different HS taxonomies cannot be compared. To ease this problem, we propose to apply extremely weak supervision that only relies on the class name rather than on class samples from the annotated data. We demonstrate the effectiveness of a state-of-the-art weakly-supervised text classification model in various in-dataset and cross-dataset settings. Furthermore, we conduct an in-depth quantitative and qualitative analysis of the source of poor generalizability of HS classification models.

Cite

CITATION STYLE

APA

Jin, Y., Wanner, L., Kadam, V. L., & Shvets, A. (2023). Towards Weakly-Supervised Hate Speech Classification Across Datasets. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 42–59). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.woah-1.4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free