The modern data analytics approaches suffer through the various challenges when dealing with large datasets, and thus, there is qualitative and quantitative degradation in the decision-making process. There is a need of an efficient analytical system that can handle on large and diverse datasets with increased accuracy in data analysis. Here, a prototype model of a cluster-based ensemble system is proposed to run on commodity machines using hierarchical approach. The commodity machines create the cluster-based ensemble in a distributed environment using a popular data mining algorithm. The experimentation proves that hierarchical approach helps to decrease time for data mining process, in turn boosts the speed of decision-making and the cluster-based ensemble improved the accuracy.
CITATION STYLE
Pole, G., & Gera, P. (2021). Cluster-Based Ensemble Using Distributed Clustering Approach for Large Categorical Data. In Lecture Notes in Networks and Systems (Vol. 154, pp. 671–680). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-15-8354-4_67
Mendeley helps you to discover research relevant for your work.