Similarity-based data reduction and classification

Gongde Guo; Hui Wang; David Bell; Zhining Liao

Conference Proceedings

Similarity-based data reduction and classification

Advances in Soft Computing (2005) 28 227-238

DOI: 10.1007/3-540-32370-8_16

2Citations

1Readers

Get full text

Abstract

The κ-Nearest-Neighbors (κNN) is a simple but effective method for classification. The major drawbacks with respect to κNN are (1) low efficiency and (2) dependence on the parameter κ. In this paper, we propose a novel similarity-based data reduction method and several variations aimed at overcoming these shortcomings. Our method constructs a similarity-based model for the data, which replaces the data to serve as the basis of classification. The value of κ is automatically determined, is varied in terms of local data distribution, and is optimal in terms of classification accuracy. The construction of the model significantly reduces the number of data for learning, thus making classification faster. Experiments conducted on some public data sets show that the proposed methods compare well with other data reduction methods in both efficiency and effectiveness.© Springer-Verlag Berlin Heidelberg 2005.

Author supplied keywords

Cite

CITATION STYLE

APA

Guo, G., Wang, H., Bell, D., & Liao, Z. (2005). Similarity-based data reduction and classification. In Advances in Soft Computing (Vol. 28, pp. 227–238). Springer Verlag. https://doi.org/10.1007/3-540-32370-8_16

Similarity-based data reduction and classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions