Towards Weakly-Supervised Hate Speech Classification Across Datasets

Yiping Jin; Leo Wanner; Vishakha Laxman Kadam; Alexander Shvets

Conference Proceedings

Towards Weakly-Supervised Hate Speech Classification Across Datasets

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 42-59

DOI: 10.18653/v1/2023.woah-1.4

2Citations

19Readers

Get full text

Abstract

As pointed out by several scholars, current research on hate speech (HS) recognition is characterized by unsystematic data creation strategies and diverging annotation schemata. Subsequently, supervised-learning models tend to generalize poorly to datasets they were not trained on, and the performance of the models trained on datasets labeled using different HS taxonomies cannot be compared. To ease this problem, we propose to apply extremely weak supervision that only relies on the class name rather than on class samples from the annotated data. We demonstrate the effectiveness of a state-of-the-art weakly-supervised text classification model in various in-dataset and cross-dataset settings. Furthermore, we conduct an in-depth quantitative and qualitative analysis of the source of poor generalizability of HS classification models.

Cite

CITATION STYLE

APA

Jin, Y., Wanner, L., Kadam, V. L., & Shvets, A. (2023). Towards Weakly-Supervised Hate Speech Classification Across Datasets. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 42–59). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.woah-1.4

Towards Weakly-Supervised Hate Speech Classification Across Datasets

Abstract

Cite

Register to see more suggestions