Abstract
Mean shift clustering finds the modes of the data probability density by identifying the zero points of the density gradient. Since it does not require to fix the number of clusters in advance, the mean shift has been a popular clustering algorithm in various application fields. A typical implementation of the mean shift is to first estimate the density by kernel density estimation and then compute its gradient. However, since a good density estimation does not necessarily imply an accurate estimation of the density gradient, such an indirect two-step approach is not reliable. In this paper, we propose a method to directly estimate the gradient of the log-density without going through density estimation. The proposed method gives the global solution analytically and thus is computationally efficient. We then develop a mean-shift-like fixed-point algorithm to find the modes of the density for clustering. As in the mean shift, one does not need to set the number of clusters in advance. We experimentally show that the proposed clustering method significantly outperforms the mean shift especially for high-dimensional data. © 2014 Springer-Verlag.
Author supplied keywords
Cite
CITATION STYLE
Sasaki, H., Hyvärinen, A., & Sugiyama, M. (2014). Clustering via mode seeking by direct estimation of the gradient of a log-density. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8726 LNAI, pp. 19–34). Springer Verlag. https://doi.org/10.1007/978-3-662-44845-8_2
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.