A novel stacked ensemble for hate speech recognition

Mona Khalifa A. Aljero; Nazife Dimililer

Journal ArticleOPEN ACCESS

A novel stacked ensemble for hate speech recognition

Applied Sciences (Switzerland) (2021) 11(24)

DOI: 10.3390/app112411684

29Citations

41Readers

Abstract

Detecting harmful content or hate speech on social media is a significant challenge due to the high throughput and large volume of content production on these platforms. Identifying hate speech in a timely manner is crucial in preventing its dissemination. We propose a novel stacked ensemble approach for detecting hate speech in English tweets. The proposed architecture employs an ensemble of three classifiers, namely support vector machine (SVM), logistic regression (LR), and XGBoost classifier (XGB), trained using word2vec and universal encoding features. The meta classifier, LR, combines the outputs of the three base classifiers and the features employed by the base classifiers to produce the final output. It is shown that the proposed architecture improves the performance of the widely used single classifiers as well as the standard stacking and classifier ensemble using majority voting. We also present results on the use of various combinations of machine learning classifiers as base classifiers. The experimental results from the proposed architecture indicated an improvement in the performance on all four datasets compared with the standard stacking, base classifiers, and majority voting. Furthermore, on three of these datasets, the proposed architecture outperformed all state-of-the-art systems.

Author supplied keywords

Cite

CITATION STYLE

APA

Aljero, M. K. A., & Dimililer, N. (2021). A novel stacked ensemble for hate speech recognition. Applied Sciences (Switzerland), 11(24). https://doi.org/10.3390/app112411684

A novel stacked ensemble for hate speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions