Determination of similarity threshold in clustering problems for large data sets

Guillermo Sánchez-Díaz; José F. Martínez-Trinidad

Journal ArticleOPEN ACCESS

Determination of similarity threshold in clustering problems for large data sets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2905 611-618

DOI: 10.1007/978-3-540-24586-5_75

3Citations

6Readers

Abstract

A new automatic method based on an intra-cluster criterion, to obtain a similarity threshold that generates a well-defined clustering (or near to it) for large data sets, is proposed. This method uses the connected component criterion, and it neither calculates nor stores the similarity matrix of the objects in main memory. The proposed method is focussed on unsupervised Logical Combinatorial Pattern Recognition approach. In addition, some experimentations of the new method with large data sets are presented. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Sánchez-Díaz, G., & Martínez-Trinidad, J. F. (2003). Determination of similarity threshold in clustering problems for large data sets. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2905, 611–618. https://doi.org/10.1007/978-3-540-24586-5_75

Determination of similarity threshold in clustering problems for large data sets

Abstract

Cite

Register to see more suggestions