Developing A Parallel Classifier For Mining In Big Data Sets

Ahad Shamseen; Morteza Mohammadi Zanjireh; Mahdi Bahaghighat; Qin Xin

Journal ArticleOPEN ACCESS

Developing A Parallel Classifier For Mining In Big Data Sets

IIUM Engineering Journal (2021) 22(2) 119-134

DOI: 10.31436/iiumej.v22i2.1541

3Citations

9Readers

Abstract

ABSTRACT: Data mining is the extraction of information and its roles from a vast amount of data. This topic is one of the most important topics these days. Nowadays, massive amounts of data are generated and stored each day. This data has useful information in different fields that attract programmers’ and engineers’ attention. One of the primary data mining classifying algorithms is the decision tree. Decision tree techniques have several advantages but also present drawbacks. One of its main drawbacks is its need to reside its data in the main memory. SPRINT is one of the decision tree builder classifiers that has proposed a fix for this problem. In this paper, our research developed a new parallel decision tree classifier by working on SPRINT results. Our experimental results show considerable improvements in terms of the runtime and memory requirements compared to the SPRINT classifier. Our proposed classifier algorithm could be implemented in serial and parallel environments and can deal with big data.

Author supplied keywords

Cite

CITATION STYLE

APA

Shamseen, A., Zanjireh, M. M., Bahaghighat, M., & Xin, Q. (2021). Developing A Parallel Classifier For Mining In Big Data Sets. IIUM Engineering Journal, 22(2), 119–134. https://doi.org/10.31436/iiumej.v22i2.1541

Developing A Parallel Classifier For Mining In Big Data Sets

Abstract

Author supplied keywords

Cite

Register to see more suggestions