Using Clustering for Supervised Feature Selection to Detect Relevant Features

Christoph Lohrmann; Pasi Luukka

Conference Proceedings

Using Clustering for Supervised Feature Selection to Detect Relevant Features

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11943 LNCS 272-283

DOI: 10.1007/978-3-030-37599-7_23

2Citations

11Readers

Get full text

Abstract

In many applications in machine learning, large quantities of features and information are available, but these can be of low quality. A novel filter method for feature selection for classification termed COLD is presented that uses class-wise clustering to reduce the dimensionality of the data. The idea behind this approach is that if a relevant feature would be removed from the set of features, the separation of clusters belonging to different classes will deteriorate. Four artificial examples and two real-world data sets are presented on which COLD is compared with several popular filter methods. For the artificial examples, only COLD is capable to consistently rank the features according to their contribution to the separation of the classes. For the real-world Dermatology and Arrhythmia dataset, COLD demonstrates the ability to remove a large number of features and improve the classification accuracy or, at a minimum, not degrade the performance considerably.

Author supplied keywords

Cite

CITATION STYLE

APA

Lohrmann, C., & Luukka, P. (2019). Using Clustering for Supervised Feature Selection to Detect Relevant Features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11943 LNCS, pp. 272–283). Springer. https://doi.org/10.1007/978-3-030-37599-7_23

Using Clustering for Supervised Feature Selection to Detect Relevant Features

Abstract

Author supplied keywords

Cite

Register to see more suggestions