Unsupervised joint feature discretization and selection

Artur Ferreira; Mário Figueiredo

Conference Proceedings

Unsupervised joint feature discretization and selection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6669 LNCS 200-207

DOI: 10.1007/978-3-642-21257-4_25

7Citations

4Readers

Get full text

Abstract

In many applications, we deal with high dimensional datasets with different types of data. For instance, in text classification and information retrieval problems, we have large collections of documents. Each text is usually represented by a bag-of-words or similar representation, with a large number of features (terms). Many of these features may be irrelevant (or even detrimental) for the learning tasks. This excessive number of features carries the problem of memory usage in order to represent and deal with these collections, clearly showing the need for adequate techniques for feature representation, reduction, and selection, to both improve the classification accuracy and the memory requirements. In this paper, we propose a combined unsupervised feature discretization and feature selection technique. The experimental results on standard datasets show the efficiency of the proposed techniques as well as improvement over previous similar techniques. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Ferreira, A., & Figueiredo, M. (2011). Unsupervised joint feature discretization and selection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6669 LNCS, pp. 200–207). https://doi.org/10.1007/978-3-642-21257-4_25

Unsupervised joint feature discretization and selection

Abstract

Author supplied keywords

Cite

Register to see more suggestions