Estimating the number of clusters as a pre-processing step to unsupervised learning

Paulo Rogerio Nietto; Maria do Carmo Nicoletti

Conference Proceedings

Estimating the number of clusters as a pre-processing step to unsupervised learning

Advances in Intelligent Systems and Computing (2017) 557 25-34

DOI: 10.1007/978-3-319-53480-0_3

2Citations

1Readers

Get full text

Abstract

A great challenge in machine learning, as far as unsupervised algorithms are concerned, is to devise methods for pre-estimating the number of clusters associated to a given set of patterns to be clustered. By doing so and by using the number of clusters as input to clustering algorithms that require the information, the chances of getting better results increase substantially. The work described in this paper investigates the performance of an algorithm, based on the sequential clustering BSAS (Basic Sequential Algorithmic Scheme), to produce an ordered list (by frequency of occurrences), containing good estimates for the number of clusters in a given set of patterns. The BSAS is a convenient choice since the order in which patterns are presented to the algorithm can impact the induced clustering. The results of the experiments in eight sets of patterns can be considered empirical evidence that the procedure can be a practical and reliable option, as a pre-processing step, to using clustering algorithms that require the number of clusters.

Author supplied keywords

Cite

CITATION STYLE

APA

Nietto, P. R., & do Carmo Nicoletti, M. (2017). Estimating the number of clusters as a pre-processing step to unsupervised learning. In Advances in Intelligent Systems and Computing (Vol. 557, pp. 25–34). Springer Verlag. https://doi.org/10.1007/978-3-319-53480-0_3

Estimating the number of clusters as a pre-processing step to unsupervised learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions