Feature extraction using single variable classifiers for binary text classification

Hakan Altinçay

Conference Proceedings

Feature extraction using single variable classifiers for binary text classification

Altinçay H

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7906 LNAI 332-340

DOI: 10.1007/978-3-642-38577-3_34

2Citations

4Readers

Get full text

Abstract

The most popular approach for document representation is the bag-of-words where terms are considered as features. In order to compute the values of these features, the term frequencies are generally scaled by a collection frequency factor to take into account the relative importance of different terms. The term frequencies can be considered as raw data about the input document. In this study, a novel framework for feature extraction is proposed for binary text classification where feature extraction is defined as a single variable classification problem. The term frequencies are the inputs and the output of each classifier is used to define a triple of features for the corresponding term. The magnitude of the classifier output that is in the interval [0.5,1] is an indicator for the confidence of the classifier and it is also employed in document representation together with the term frequency and the collection frequency factor. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Altinçay, H. (2013). Feature extraction using single variable classifiers for binary text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7906 LNAI, pp. 332–340). https://doi.org/10.1007/978-3-642-38577-3_34

Feature extraction using single variable classifiers for binary text classification

Abstract

Author supplied keywords

Cite

Register to see more suggestions