Does feature selection improve classification? A large scale experiment in OpenML

Martijn J. Post; Peter Van Der Putten; Jan N. Van Rijn

Conference Proceedings

Does feature selection improve classification? A large scale experiment in OpenML

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9897 LNCS 158-170

DOI: 10.1007/978-3-319-46349-0_14

17Citations

30Readers

Get full text

Abstract

It is often claimed that data pre-processing is an important factor contributing towards the performance of classification algorithms. In this paper we investigate feature selection, a common data preprocessing technique. We conduct a large scale experiment and present results on what algorithms and data sets benefit from this technique. Using meta-learning we can find out for which combinations this is the case. To complement a large set of meta-features, we introduce the Feature Selection Landmarkers, which prove useful for this task. All our experimental results are made publicly available on OpenML.

Author supplied keywords

Cite

CITATION STYLE

APA

Post, M. J., Van Der Putten, P., & Van Rijn, J. N. (2016). Does feature selection improve classification? A large scale experiment in OpenML. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9897 LNCS, pp. 158–170). Springer Verlag. https://doi.org/10.1007/978-3-319-46349-0_14

Does feature selection improve classification? A large scale experiment in OpenML

Abstract

Author supplied keywords

Cite

Register to see more suggestions