The prevalence of hate speech on online platforms has become a pressing concern for society, leading to increased attention towards detecting hate speech. Prior work in this area has primarily focused on identifying hate speech at the utterance level that reflects the complex nature of hate speech. In this paper, we propose a targeted and efficient approach to identifying hate speech by detecting slurs at the lexical level using contextualized word embeddings. We hypothesize that slurs have a systematically different representation than their neutral counterparts, making them identifiable through existing methods for discovering semantic dimensions in word embeddings. The results demonstrate the effectiveness of our approach in predicting slurs, confirming linguistic theory that the meaning of slurs is stable across contexts. Our robust hate dimension approach for slur identification offers a promising solution to tackle a smaller yet crucial piece of the complex puzzle of hate speech detection.
CITATION STYLE
Hoeken, S., Zarrieß, S., & Alaçam, Ö. (2023). Identifying Slurs and Lexical Hate Speech via Light-Weight Dimension Projection in Embedding Space. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 278–289). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.wassa-1.25
Mendeley helps you to discover research relevant for your work.