Clustering large data set: An applied comparative study

Laura Bocci; Isabella Mingo

Book Chapter

Clustering large data set: An applied comparative study

Springer International Publishing, (2012), 3-12

DOI: 10.1007/978-3-642-21037-2_1

0Citations

3Readers

Get full text

Abstract

The aim of this paper is to analyze different strategies to cluster large data sets derived from social context. For the purpose of clustering, trials on effective and efficient methods for large databases have only been carried out in recent years due to the emergence of the field of data mining. In this paper a sequential approach based on multiobjective genetic algorithm as clustering technique is proposed. The proposed strategy is applied to a real-life data set consisting of approximately 1.5 million workers and the results are compared with those obtained by other methods to find out an unambiguous partitioning of data.

Cite

CITATION STYLE

APA

Bocci, L., & Mingo, I. (2012). Clustering large data set: An applied comparative study. In Studies in Theoretical and Applied Statistics, Selected Papers of the Statistical Societies (pp. 3–12). Springer International Publishing. https://doi.org/10.1007/978-3-642-21037-2_1

Clustering large data set: An applied comparative study

Abstract

Cite

Register to see more suggestions