Various approaches to produce cluster ensembles and several consensus functions to combine data partitions have been proposed in order to obtain a more robust partition of the data. However, the existence of many approaches leads to another problem which consists in knowing which of these approaches to produce the cluster ensembles' data and to combine these partitions best fits a given data set. In this paper, we propose a new measure to select the best consensus data partition, among a variety of consensus partitions, based on the concept of average cluster consistency between each data partition that belongs to the cluster ensemble and a given consensus partition. The experimental results obtained by comparing this measure with other measures for cluster ensemble selection in 9 data sets, showed that the partitions selected by our measure generally were of superior quality in comparison with the consensus partitions selected by other measures. © 2011 Springer-Verlag.
CITATION STYLE
Duarte, F. J. F., Duarte, J. M. M., Fred, A. L. N., & Rodrigues, M. F. C. (2011). Average cluster consistency for cluster ensemble selection. In Communications in Computer and Information Science (Vol. 128 CCIS, pp. 133–148). https://doi.org/10.1007/978-3-642-19032-2_10
Mendeley helps you to discover research relevant for your work.