Improving classification accuracy using clustering technique

24Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.

Abstract

Product classification is the key issue in e-commerce domains. Many products are released to the market rapidly and to select the correct category in taxonomy for each product has become a challenging task. The application of classification model is useful to precisely classify the products. The study proposed a method to apply clustering prior to classification. This study has used a large-scale real-world data set to identify the efficiency of clustering technique to improve the classification model. The conventional text classification procedures are used in the study such as preprocessing, feature extraction and feature selection before applying the clustering technique. Results show that clustering technique improves the accuracy of the classification model. The best classification model for all three approaches which are classification model only, classification with hierarchical clustering and classification with K-means clustering is K-Nearest Neighbor (KNN) model. Even though the accuracy of the KNN models are the same across different approaches but the KNN model with K-means clustering had the shortest time of execution. Hence, applying K-means clustering prior to KNN model helps in reducing the computation time.

Cite

CITATION STYLE

APA

Mathivanan, N. M. N., Nor, N. A., & Janor, R. M. (2018). Improving classification accuracy using clustering technique. Bulletin of Electrical Engineering and Informatics, 7(3), 465–470. https://doi.org/10.11591/eei.v7i3.1272

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free