Estimating the number of clusters as a pre-processing step to unsupervised learning

2Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A great challenge in machine learning, as far as unsupervised algorithms are concerned, is to devise methods for pre-estimating the number of clusters associated to a given set of patterns to be clustered. By doing so and by using the number of clusters as input to clustering algorithms that require the information, the chances of getting better results increase substantially. The work described in this paper investigates the performance of an algorithm, based on the sequential clustering BSAS (Basic Sequential Algorithmic Scheme), to produce an ordered list (by frequency of occurrences), containing good estimates for the number of clusters in a given set of patterns. The BSAS is a convenient choice since the order in which patterns are presented to the algorithm can impact the induced clustering. The results of the experiments in eight sets of patterns can be considered empirical evidence that the procedure can be a practical and reliable option, as a pre-processing step, to using clustering algorithms that require the number of clusters.

Cite

CITATION STYLE

APA

Nietto, P. R., & do Carmo Nicoletti, M. (2017). Estimating the number of clusters as a pre-processing step to unsupervised learning. In Advances in Intelligent Systems and Computing (Vol. 557, pp. 25–34). Springer Verlag. https://doi.org/10.1007/978-3-319-53480-0_3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free