Local latent semantic analysis based on support vector machine for imbalanced text categorization

1Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Many text categorization tasks involve imbalanced training examples. We tackle this problem by using improved local Latent Semantic Analysis. LSA has been shown to be extremely useful but it is not an optimal representation for text categorization because this unsupervised method ignores class discrimination while only concentrating on representation. Some local LSI methods have been proposed to improve the classification by utilizing class discrimination information. In this paper, we choose support vector machine (SVM) to generate imbalanced dataset as the local regions for local LSA. Experimental results show that our method is better than global LSA and traditional local LSA methods on classification within a much smaller LSA dimension. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Wan, Y., Tong, H., & Deng, Y. (2011). Local latent semantic analysis based on support vector machine for imbalanced text categorization. In Communications in Computer and Information Science (Vol. 226 CCIS, pp. 321–329). https://doi.org/10.1007/978-3-642-23235-0_42

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free