Poisson naive Bayes for text classification with feature weighting

17Citations
Citations of this article
105Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we investigate the use of multivariate Poisson model and feature weighting to learn naive Bayes text classifier. Our new naive Bayes text classification model assumes that a document is generated by a multivariate Poisson model while the previous works consider a document as a vector of binary term features based on the presence or absence of each term. We also explore the use of feature weighting for the naive Bayes text classification rather than feature selection, which is a quite costly process when a small number of the new training documents are continuously provided. Experimental results on the two test collections indicate that our new model with the proposed parameter estimation and the feature weighting technique leads to substantial improvements compared to the unigram language model classifiers that are known to outperform the original pure naive Bayes text classifiers.

Cite

CITATION STYLE

APA

Kim, S. B., Seo, H. C., & Rim, H. C. (2003). Poisson naive Bayes for text classification with feature weighting. In Proceedings of the 6th International Workshop on Information Retrieval with Asian Languages, IRAL 2003 (pp. 33–40). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1118935.1118940

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free