Feature selection problem is one of the main important problems in the text and data mining domain. This paper presents a comparative study of feature selection methods for Arabic text classification. Five of the feature selection methods were selected: ICHI square, CHI square, Information Gain, Mutual Information and Wrapper. It was tested with five classification algorithms: Bayes Net, Naive Bayes, Random Forest, Decision Tree and Artificial Neural Networks. In addition, Data Collection was used in Arabic consisting of 9055 documents, which were compared by four criteria: Precision, Recall, F-measure and Time to build model. The results showed that the improved ICHI feature selection got almost all the best results in comparison with other methods.
CITATION STYLE
Alshaer, H. N., Otair, M. A., & Abualigah, L. (2020). Improved ICHI square feature selection method for Arabic classifiers. International Journal of Informatics and Communication Technology (IJ-ICT), 9(3), 157. https://doi.org/10.11591/ijict.v9i3.pp157-170
Mendeley helps you to discover research relevant for your work.