In this paper we examine methods to detect hate speech in social media, while distinguishing this from general profanity. We aim to establish lexical baselines for this task by applying supervised classification methods using a recently released dataset annotated for this purpose. As features, our system uses character n-grams, word n-grams and word skip-grams. We obtain results of 78% accuracy in identifying posts across three classes. Results demonstrate that the main challenge lies in discriminating profanity and hate speech from each other. A number of directions for future work are discussed.
CITATION STYLE
Malmasi, S., & Zampieri, M. (2017). Detecting hate speech in social media. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2017-September, pp. 467–472). Incoma Ltd. https://doi.org/10.26615/978-954-452-049-6_062
Mendeley helps you to discover research relevant for your work.