A divisive ordering algorithm for mapping categorical data to numeric data

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The amount of computing time for K Nearest Neighbor Search is linear to the size of the dataset if the dataset is not indexed. This is not endurable for on-line applications with time constraints when the dataset is large. However, if there are categorical attributes in the dataset, an index cannot be built on the dataset. One possible solution to index such datasets is to convert categorical attributes into numeric attributes. Categories are ordered and then are mapped to numeric values. In this paper, we propose a new heuristic ordering algorithm to compare with two previously proposed algorithms that borrow the idea from minimal spanning trees. The new algorithm divisively builds a binary tree by recursively partitioning the categories. Then, we in-order traverse the tree and get an ordering of the categories. After mapping and indexing, we can efficiently retrieve a small portion of the dataset and perform K nearest neighbor search on the portion at the cost of a little bit of accuracy. Experiments show the divisive ordering algorithm performs better than the other two algorithms. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Kuo, H. C. (2005). A divisive ordering algorithm for mapping categorical data to numeric data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3682 LNAI, pp. 979–985). Springer Verlag. https://doi.org/10.1007/11552451_135

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free