Support vector machines training data selection using a genetic algorithm

Michal Kawulok; Jakub Nalepa

Conference ProceedingsOPEN ACCESS

Support vector machines training data selection using a genetic algorithm

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7626 LNCS 557-565

DOI: 10.1007/978-3-642-34166-3_61

50Citations

12Readers

Abstract

This paper presents a new method for selecting valuable training data for support vector machines (SVM) from large, noisy sets using a genetic algorithm (GA). SVM training data selection is a known, however not extensively investigated problem. The existing methods rely mainly on analyzing the geometric properties of the data or adapt a randomized selection, and to the best of our knowledge, GA-based approaches have not been applied for this purpose yet. Our work was inspired by the problems encountered when using SVM for skin segmentation. Due to a very large set size, the existing methods are too time-consuming, and random selection is not effective because of the set noisiness. In the work reported here we demonstrate how a GA can be used to optimize the training set, and we present extensive experimental results which confirm that the new method is highly effective for real-world data. © 2012 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Kawulok, M., & Nalepa, J. (2012). Support vector machines training data selection using a genetic algorithm. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7626 LNCS, pp. 557–565). https://doi.org/10.1007/978-3-642-34166-3_61

Support vector machines training data selection using a genetic algorithm

Abstract

Cite

Register to see more suggestions