Data characterization for effective prototype selection

Ramón A. Mollineda; J. Salvador Sánchez; José M. Sotoca

Conference Proceedings

Data characterization for effective prototype selection

Lecture Notes in Computer Science (2005) 3523(II) 27-34

DOI: 10.1007/11492542_4

56Citations

24Readers

Get full text

Abstract

The Nearest Neighbor classifier is one of the most popular supervised classification methods. It is very simple, intuitive and accurate in a great variety of real-world applications. Despite its simplicity and effectiveness, practical use of this rule has been historically limited due to its high storage requirements and the computational costs involved, as well as the presence of outliers. In order to overcome these drawbacks, it is possible to employ a suitable prototype selection scheme, as a way of storage and computing time reduction and it usually provides some increase in classification accuracy. Nevertheless, in some practical cases prototype selection may even produce a degradation of the classifier effectiveness. From an empirical point of view, it is still difficult to know a priori when this method will provide an appropriate behavior. The present paper tries to predict how appropriate a prototype selection algorithm will result when applied to a particular problem, by characterizing data with a set of complexity measures. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Mollineda, R. A., Sánchez, J. S., & Sotoca, J. M. (2005). Data characterization for effective prototype selection. In Lecture Notes in Computer Science (Vol. 3523, pp. 27–34). Springer Verlag. https://doi.org/10.1007/11492542_4

Data characterization for effective prototype selection

Abstract

Cite

Register to see more suggestions