Measuring constraint-set utility for partitional clustering algorithms

Ian Davidson; Kiri L. Wagstaff; Sugato Basu

Conference ProceedingsOPEN ACCESS

Measuring constraint-set utility for partitional clustering algorithms

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4213 LNAI 115-126

DOI: 10.1007/11871637_15

129Citations

61Readers

Abstract

Clustering with constraints is an active area of machine learning and data mining research. Previous empirical work has convincingly shown that adding constraints to clustering improves performance, with respect to the true data labels. However, in most of these experiments, results are averaged over different randomly chosen constraint sets, thereby masking interesting properties of individual sets. We demonstrate that constraint sets vary significantly in how useful they are for constrained clustering; some constraint sets can actually decrease algorithm performance. We create two quantitative measures, informativeness and coherence, that can be used to identify useful constraint sets. We show that these measures can also help explain differences in performance for four particular constrained clustering algorithms. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Davidson, I., Wagstaff, K. L., & Basu, S. (2006). Measuring constraint-set utility for partitional clustering algorithms. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4213 LNAI, pp. 115–126). Springer Verlag. https://doi.org/10.1007/11871637_15

Measuring constraint-set utility for partitional clustering algorithms

Abstract

Cite

Register to see more suggestions