A feature selection for text categorization on research support system Papits

Tadachika Ozono; Toramatsu Shintani; Takayuki Ito; Tomoharu Hasegawa

Conference Proceedings

A feature selection for text categorization on research support system Papits

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2004) 3157 524-533

DOI: 10.1007/978-3-540-28633-2_56

3Citations

5Readers

Get full text

Abstract

We have developed a research support system, called Papits, that shares research information, such as PDF files of research papers, in computers on the network and classifies the information into types of research fields. Users of Papits can share various research information and survey the corpora of their particular fields of research. In order to realize Papits, we need to design a mechanism for identifying what words are best suited to classify documents in predefined classes. Further we have to consider classification in cases where we must classify documents into multivalued fields and where there is insufficient data for classification. In this paper, we present an implementation method of automatic classification based on a text classification technique for Papits. We also propose a new method for using feature selection to classify documents that are represented by a bag-of-words into a multivalued category. Our method transforms the multivalued category into a binary category to easily identify the characteristic words to classify category in a few training data. Our experimental result indicates that our method can effectively classify documents in Papits. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Ozono, T., Shintani, T., Ito, T., & Hasegawa, T. (2004). A feature selection for text categorization on research support system Papits. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3157, pp. 524–533). Springer Verlag. https://doi.org/10.1007/978-3-540-28633-2_56

A feature selection for text categorization on research support system Papits

Abstract

Cite

Register to see more suggestions