Chi-square classifier for document categorization

8Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply the well-known statistical hypothesis test that considers images of documents and domains as normalized vectors. In comparison with existing methods, such approach allows to take into account a random character of initial data. The classifier is developed in the framework of Document Investigator software package.

Cite

CITATION STYLE

APA

Alexandrov, M., Gelbukh, A., & Lozovoi, G. (2001). Chi-square classifier for document categorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2004, pp. 457–459). Springer Verlag. https://doi.org/10.1007/3-540-44686-9_45

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free