The concept of cluster stability is introduced to assess the validity of data partitionings found by clustering algorithms. It allows us to explicitly quantify the quality of a clustering solution, without being dependent on external information. The principle of maximizing the cluster stability can be interpreted as choosing the most self-consistent data partitioning. We present an empirical estimator for the theoretically derived stability index, based on resampling. Experiments are conducted on well known gene expression data sets, re-analyzing the work by Alon et al. [1] and by Spellman et al. [8]. © Springer-Verlag Berlin Heidelberg 2002.
CITATION STYLE
Roth, V., Braun, M. L., Lange, T., & Buhmann, J. M. (2002). Stability-based model order selection in clustering with applications to gene expression data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2415, 607–612. https://doi.org/10.1007/3-540-46084-5_99
Mendeley helps you to discover research relevant for your work.