Abstract
The massive expansion of social networking websites has made it easier for people with various cultural and psychological backgrounds to communicate directly with one another. It has led to an increase in online conflicts between them. This paper proposes an approach to detect hate speech. A publicly available dataset of tweets in language is used. Data preprocessing includes the removal of stopwords , punctuations , emojis , numbers , URLs etc. and feature extraction is carried out using tokenization , lemmatization and POS tagging. The performances of XGBoost, Random Forest , Logistic Regression and SVM have been compared in this study for the detection of hate speech. XGBoost classifier provided highest accuracy of 74.93%. Keywords: Hate speech, Hate tweets, Machine learning.
Cite
CITATION STYLE
Nikhil, N. (2024). Hate Speech Detection. INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT, 08(05), 1–5. https://doi.org/10.55041/ijsrem34783
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.