CluSim: a python package for calculating clustering similarity

Alexander Gates; Yong-Yeol Ahn

Journal ArticleOPEN ACCESS

CluSim: a python package for calculating clustering similarity

Gates A
Ahn Y

Journal of Open Source Software (2019) 4(35) 1264

DOI: 10.21105/joss.01264

N/ACitations

23Readers

Abstract

Clustering is a primary method to reveal the structure of data (Jain, Murty, & Flynn, 1999). To understand, evaluate, and leverage data clusterings, we need to quantitatively compare them. Clustering comparison is the basis for method evaluation, consensus clustering, and tracking the temporal evolution of clusters, among many other tasks. For instance, the evaluation of a clustering method is usually achieved by comparing the method’s result to a planted reference clustering, assuming that the more similar the method’s solution is to the reference clustering, the better the method. Despite the importance of clustering comparison, no consensus has been reached for a standardized assessment; each similarity measure rewards and penalizes different criteria, sometimes producing contradictory conclusions.

Cite

CITATION STYLE

APA

Gates, A., & Ahn, Y.-Y. (2019). CluSim: a python package for calculating clustering similarity. Journal of Open Source Software, 4(35), 1264. https://doi.org/10.21105/joss.01264

CluSim: a python package for calculating clustering similarity

Abstract

Cite

Register to see more suggestions