Cluster-Based Ensemble Using Distributed Clustering Approach for Large Categorical Data

Govind Pole; Pradeepini Gera

Conference Proceedings

Cluster-Based Ensemble Using Distributed Clustering Approach for Large Categorical Data

Lecture Notes in Networks and Systems (2021) 154 671-680

DOI: 10.1007/978-981-15-8354-4_67

1Citations

3Readers

Get full text

Abstract

The modern data analytics approaches suffer through the various challenges when dealing with large datasets, and thus, there is qualitative and quantitative degradation in the decision-making process. There is a need of an efficient analytical system that can handle on large and diverse datasets with increased accuracy in data analysis. Here, a prototype model of a cluster-based ensemble system is proposed to run on commodity machines using hierarchical approach. The commodity machines create the cluster-based ensemble in a distributed environment using a popular data mining algorithm. The experimentation proves that hierarchical approach helps to decrease time for data mining process, in turn boosts the speed of decision-making and the cluster-based ensemble improved the accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Pole, G., & Gera, P. (2021). Cluster-Based Ensemble Using Distributed Clustering Approach for Large Categorical Data. In Lecture Notes in Networks and Systems (Vol. 154, pp. 671–680). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-15-8354-4_67

Cluster-Based Ensemble Using Distributed Clustering Approach for Large Categorical Data

Abstract

Author supplied keywords

Cite

Register to see more suggestions