Effects of Feature Extraction and Classification Methods on Cyberbully Detection

  • SARAÇ E
  • ÖZEL S
N/ACitations
Citations of this article
25Readers
Mendeley users who have this article in their library.

Abstract

Cyberbullying is defined as an aggressive, intentional action against a defenseless person by using the Internet, or other electronic contents. Researchers have found that many of the bullying cases have tragically ended in suicides; hence automatic detection of cyberbullying has become important. In this study we show the effects of feature extraction, feature selection, and classification methods that are used, on the performance of automatic detection of cyberbullying. To perform the experiments FormSpring.me dataset is used and the effects of preprocessing methods; several classifiers like C4.5, Naïve Bayes, kNN, and SVM; and information gain and chi square feature selection methods are investigated. Experimental results indicate that the best classification results are obtained when alphabetic tokenization, no stemming, and no stopwords removal are applied. Using feature selection also improves cyberbully detection performance. When classifiers are compared, C4.5 performs the best for the used dataset.

Cite

CITATION STYLE

APA

SARAÇ, E., & ÖZEL, S. A. (2016). Effects of Feature Extraction and Classification Methods on Cyberbully Detection. Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 21(1), 190. https://doi.org/10.19113/sdufbed.20964

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free