A feature selection for text categorization on research support system Papits

3Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We have developed a research support system, called Papits, that shares research information, such as PDF files of research papers, in computers on the network and classifies the information into types of research fields. Users of Papits can share various research information and survey the corpora of their particular fields of research. In order to realize Papits, we need to design a mechanism for identifying what words are best suited to classify documents in predefined classes. Further we have to consider classification in cases where we must classify documents into multivalued fields and where there is insufficient data for classification. In this paper, we present an implementation method of automatic classification based on a text classification technique for Papits. We also propose a new method for using feature selection to classify documents that are represented by a bag-of-words into a multivalued category. Our method transforms the multivalued category into a binary category to easily identify the characteristic words to classify category in a few training data. Our experimental result indicates that our method can effectively classify documents in Papits. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Ozono, T., Shintani, T., Ito, T., & Hasegawa, T. (2004). A feature selection for text categorization on research support system Papits. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3157, pp. 524–533). Springer Verlag. https://doi.org/10.1007/978-3-540-28633-2_56

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free