Feature extraction using single variable classifiers for binary text classification

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The most popular approach for document representation is the bag-of-words where terms are considered as features. In order to compute the values of these features, the term frequencies are generally scaled by a collection frequency factor to take into account the relative importance of different terms. The term frequencies can be considered as raw data about the input document. In this study, a novel framework for feature extraction is proposed for binary text classification where feature extraction is defined as a single variable classification problem. The term frequencies are the inputs and the output of each classifier is used to define a triple of features for the corresponding term. The magnitude of the classifier output that is in the interval [0.5,1] is an indicator for the confidence of the classifier and it is also employed in document representation together with the term frequency and the collection frequency factor. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Altinçay, H. (2013). Feature extraction using single variable classifiers for binary text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7906 LNAI, pp. 332–340). https://doi.org/10.1007/978-3-642-38577-3_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free