A new approach for calculating similarity of categorical data

Cheng Hao Jin; Xun Li; Yang Koo Lee; Gouchol Pok; Keun Ho Ryu

Conference Proceedings

A new approach for calculating similarity of categorical data

Communications in Computer and Information Science (2011) 206 CCIS 584-590

DOI: 10.1007/978-3-642-24106-2_74

0Citations

1Readers

Get full text

Abstract

Similarity measure is very important in data mining techniques such as clustering, nearest-neighbor classification, outlier detection and so on [1][4]. There are many similarity measures have been proposed. For numeric data, there are many Minkowski distance-based similarity measures. However, the similarity measures for categorical data have been studied for a long time, it also has many issues. The main issue is to understand relationship between categorical attribute values. For categorical data, the similarity measure is not clear as well as numeric data. In this paper, we propose a new approach to understand relationship between categorical data. This approach is based on artificial neural network to extract significant features for computing distance between two categorical data objects. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Jin, C. H., Li, X., Lee, Y. K., Pok, G., & Ryu, K. H. (2011). A new approach for calculating similarity of categorical data. In Communications in Computer and Information Science (Vol. 206 CCIS, pp. 584–590). https://doi.org/10.1007/978-3-642-24106-2_74

A new approach for calculating similarity of categorical data

Abstract

Author supplied keywords

Cite

Register to see more suggestions