Hierarchical clustering via penalty-based aggregation and the Genie approach

5Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The paper discusses a generalization of the nearest centroid hierarchical clustering algorithm. A first extension deals with the incorporation of generic distance-based penalty minimizers instead of the classical aggregation by means of centroids. Due to that the presented algorithm can be applied in spaces equipped with an arbitrary dissimilarity measure (images, DNA sequences, etc.). Secondly, a correction preventing the formation of clusters of too highly unbalanced sizes is applied: just like in the recently introduced Genie approach, which extends the single linkage scheme, the new method averts a chosen inequity measure (e.g., the Gini-, de Vergottini-, or Bonferroni-index) of cluster sizes from raising above a predefined threshold. Numerous benchmarks indicate that the introduction of such a correction increases the quality of the resulting clusterings significantly.

Cite

CITATION STYLE

APA

Gagolewski, M., Cena, A., & Bartoszuk, M. (2016). Hierarchical clustering via penalty-based aggregation and the Genie approach. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9880 LNAI, pp. 191–202). Springer Verlag. https://doi.org/10.1007/978-3-319-45656-0_16

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free