Similarity measure is very important in data mining techniques such as clustering, nearest-neighbor classification, outlier detection and so on [1][4]. There are many similarity measures have been proposed. For numeric data, there are many Minkowski distance-based similarity measures. However, the similarity measures for categorical data have been studied for a long time, it also has many issues. The main issue is to understand relationship between categorical attribute values. For categorical data, the similarity measure is not clear as well as numeric data. In this paper, we propose a new approach to understand relationship between categorical data. This approach is based on artificial neural network to extract significant features for computing distance between two categorical data objects. © 2011 Springer-Verlag.
CITATION STYLE
Jin, C. H., Li, X., Lee, Y. K., Pok, G., & Ryu, K. H. (2011). A new approach for calculating similarity of categorical data. In Communications in Computer and Information Science (Vol. 206 CCIS, pp. 584–590). https://doi.org/10.1007/978-3-642-24106-2_74
Mendeley helps you to discover research relevant for your work.