Parallel fuzzy c-means clustering for large data sets

Terence Kwok; Kate Smith; Sebastian Lozano; David Taniar

Conference ProceedingsOPEN ACCESS

Parallel fuzzy c-means clustering for large data sets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2400 365-374

DOI: 10.1007/3-540-45706-2_48

94Citations

48Readers

Abstract

The parallel fuzzy c-means (PFCM) algorithm for clustering large data sets is proposed in this paper. The proposed algorithm is designed to run on parallel computers of the Single Program Multiple Data (SPMD) model type with the Message Passing Interface (MPI). A comparison is made between PFCM and an existing parallel k-means (PKM) algorithm in terms of their parallelisation capability and scalability. In an implementation of PFCM to cluster a large data set from an insurance company, the proposed algorithm is demonstrated to have almost ideal speedups as well as an excellent scaleup with respect to the size of the data sets.

Cite

CITATION STYLE

APA

Kwok, T., Smith, K., Lozano, S., & Taniar, D. (2002). Parallel fuzzy c-means clustering for large data sets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2400, pp. 365–374). Springer Verlag. https://doi.org/10.1007/3-540-45706-2_48

Parallel fuzzy c-means clustering for large data sets

Abstract

Cite

Register to see more suggestions