On the relation between kNN accuracy and dataset compression level

Marcin Blachnik

Conference Proceedings

On the relation between kNN accuracy and dataset compression level

Blachnik M

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9692 541-551

DOI: 10.1007/978-3-319-39378-0_46

2Citations

2Readers

Get full text

Abstract

The paper discusses how instance selection can be used to asses the kNN performance. There exists a strong correlation between the compression level of the dataset obtained by instance selection methods and the prediction accuracy obtained the k-NN classifier trained on full training dataset. Based on two standard algorithms of instance selection namely CNN and ENN, which belong to two different groups of methods, so called condensation and editing methods, we perform empirical analysis to verify this relation. The obtained results show that this relation is almost linear, so that the level of compression is linearly correlated with the accuracy. In other words by knowing the compression of instance selection methods we are able to estimate the accuracy of the final kNN prediction model.

Cite

CITATION STYLE

APA

Blachnik, M. (2016). On the relation between kNN accuracy and dataset compression level. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9692, pp. 541–551). Springer Verlag. https://doi.org/10.1007/978-3-319-39378-0_46

On the relation between kNN accuracy and dataset compression level

Abstract

Cite

Register to see more suggestions