Extended Bi-gram features in text categorization

1Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Usually, in traditional text categorization systems based on Vector Space Model, there is no context information in a feature vector, which limited the performance of the system. To make use of more information, it is natural to select bi-gram feature in addition to unigram feature. However, the longer the feature is, the more important the feature selection algorithm is to get good balance in feature space This paper proposed two feature extraction methods which can get better feature balance for document categorization. Experiments show that our extended bi-gram feature improved system performance greatly. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Zhang, X., & Zhu, X. (2005). Extended Bi-gram features in text categorization. In Lecture Notes in Computer Science (Vol. 3523, pp. 379–386). Springer Verlag. https://doi.org/10.1007/11492542_47

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free