Applying machine learning techniques for religious extremism detection on online user contents

13Citations
Citations of this article
22Readers
Mendeley users who have this article in their library.

Abstract

In this research paper, we propose a corpus for the task of detecting religious extremism in social networks and open sources and compare various machine learning algorithms for the binary classification problem using a previously created corpus, thereby checking whether it is possible to detect extremist messages in the Kazakh language. To do this, the authors trained models using six classic machine-learning algorithms such as Support Vector Machine, Decision Tree, Random Forest, K Nearest Neighbors, Naive Bayes, and Logistic Regression. To increase the accuracy of detecting extremist texts, we used various characteristics such as Statistical Features, TF-IDF, POS, LIWC, and applied oversampling and undersampling techniques to handle imbalanced data. As a result, we achieved 98% accuracy in detecting religious extremism in Kazakh texts for the collected dataset. Testing the developed machine learning models in various databases that are often found in everyday life “Jokes”, “News”, “Toxic content”, “Spam”, “Advertising” has also shown high rates of extremism detection.

Cite

CITATION STYLE

APA

Mussiraliyeva, S., Omarov, B., Yoo, P., & Bolatbek, M. (2021). Applying machine learning techniques for religious extremism detection on online user contents. Computers, Materials and Continua, 70(1), 915–934. https://doi.org/10.32604/cmc.2022.019189

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free