Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space

1Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

The prevalence of hate speech on online platforms has become a pressing concern for society, leading to increased attention towards detecting hate speech. Prior work in this area has primarily focused on identifying hate speech at the utterance level that reflects the complex nature of hate speech. In this paper, we propose a targeted and efficient approach to identifying hate speech by detecting slurs at the lexical level using contextualized word embeddings. We hypothesize that slurs have a systematically different representation than their neutral counterparts, making them identifiable through existing methods for discovering semantic dimensions in word embeddings. The results demonstrate the effectiveness of our approach in predicting slurs, confirming linguistic theory that the meaning of slurs is stable across contexts. Our robust hate dimension approach for slur identification offers a promising solution to tackle a smaller yet crucial piece of the complex puzzle of hate speech detection.

Cite

CITATION STYLE

APA

Hoeken, S., Zarrieß, S., & Alaçam, Ö. (2023). Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 278–289). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.wassa-1.25

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free