A new k-means-based algorithm for automatic clustering and outlier discovery

Trushali Jambudi; Savita Gandhi

Conference Proceedings

A new k-means-based algorithm for automatic clustering and outlier discovery

Smart Innovation, Systems and Technologies (2019) 107 457-467

DOI: 10.1007/978-981-13-1747-7_44

4Citations

5Readers

Get full text

Abstract

K-means is one of the most popular partition-based clustering algorithms that partition data objects based on attributes/features into K number of groups or clusters. In this paper, we address the major issues affecting the performance of k-means clustering algorithm. We have proposed as well as implemented a new k-means-based clustering algorithm which forms clusters by detecting and removing both global and local outliers and automatically converging into optimal clusters which are formed by a two-part process of splitting the initial clusters into subclusters based on criterion at local level and, in the second part, merging the clusters that satisfy the nearness criterion. Experiments show that our algorithm is able to automatically generate optimal number of clusters of different sizes and shapes which are free from global and local outliers.

Author supplied keywords

Cite

CITATION STYLE

APA

Jambudi, T., & Gandhi, S. (2019). A new k-means-based algorithm for automatic clustering and outlier discovery. In Smart Innovation, Systems and Technologies (Vol. 107, pp. 457–467). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-13-1747-7_44

A new k-means-based algorithm for automatic clustering and outlier discovery

Abstract

Author supplied keywords

Cite

Register to see more suggestions