Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space

Sanne Hoeken; Sina Zarrieß; Özge Alaçam

Conference ProceedingsOPEN ACCESS

Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space

Proceedings of the Annual Meeting of the Association for Computational Linguistics (2023) 278-289

DOI: 10.18653/v1/2023.wassa-1.25

1Citations

9Readers

Abstract

The prevalence of hate speech on online platforms has become a pressing concern for society, leading to increased attention towards detecting hate speech. Prior work in this area has primarily focused on identifying hate speech at the utterance level that reflects the complex nature of hate speech. In this paper, we propose a targeted and efficient approach to identifying hate speech by detecting slurs at the lexical level using contextualized word embeddings. We hypothesize that slurs have a systematically different representation than their neutral counterparts, making them identifiable through existing methods for discovering semantic dimensions in word embeddings. The results demonstrate the effectiveness of our approach in predicting slurs, confirming linguistic theory that the meaning of slurs is stable across contexts. Our robust hate dimension approach for slur identification offers a promising solution to tackle a smaller yet crucial piece of the complex puzzle of hate speech detection.

Cite

CITATION STYLE

APA

Hoeken, S., Zarrieß, S., & Alaçam, Ö. (2023). Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 278–289). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.wassa-1.25

Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space

Abstract

Cite

Register to see more suggestions