A two-step feature selection method for quranic text classification

19Citations
Citations of this article
46Readers
Mendeley users who have this article in their library.

Abstract

Feature selection is an integral phase in text classification problems. It is primarily applied in preprocessing text data prior to labeling. However, there exist some limitations with the FS techniques. The filter-based FS techniques have the drawback of lower accuracy performance while the wrapper-based techniques are highly computationally expensive to process. In this paper, a two-step FS method is presented. In the first step, chisquare (CH) filter-based technique is used to reduce the dimensionality of the feature set and then wrapper correlation-based (CFS) technique is employed in the second step to further select most relevant features from the reduced feature set. Specifically, the ultimate aim is to reduce the computational runtime while achieving high classification accuracy. Subsequently, the proposed method was applied in labeling instances of the input data (Quranic verses) using standard classifiers: naïve bayes (NB), support vector machine (SVM), decision trees (J48). The results report the proposed method achieved accuracy result of 93.6% at 4.17secs.

Cite

CITATION STYLE

APA

Adeleke, A., Samsudin, N. A., Othman, Z. A., & Ahmad Khalid, S. K. (2019). A two-step feature selection method for quranic text classification. Indonesian Journal of Electrical Engineering and Computer Science, 16(2), 730–736. https://doi.org/10.11591/ijeecs.v16.i2.pp730-736

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free