In this paper, we propose a framework that maps categorical data into a numerical data space via a reference set, aiming to make the existing numerical clustering algorithms directly applicable on the generated image data set as well as to visualize the data. Using statistics theories, we analyze our framework and give the conditions under which the data mapping is efficient and yet preserves a flexible property of the original data, i.e. the data points within the same cluster are more similar. The algorithm is simple and has good effectiveness under some conditions. The experimental evaluation on numerous categorical data sets shows that it not only outperforms the related data mapping approaches but also beats some categorical clustering algorithms in terms of effectiveness and efficiency. © 2008 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Shen, Z. Y., Sun, J., Shen, Y. D., & Li, M. (2008). R-map: Mapping categorical data for clustering and visualization based on reference sets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5012 LNAI, pp. 992–998). https://doi.org/10.1007/978-3-540-68125-0_104
Mendeley helps you to discover research relevant for your work.