Multiple concept learning - A novel approach to feature selection in text categorization

Son Doan; Susumu Horiguchi

Conference Proceedings

Multiple concept learning - A novel approach to feature selection in text categorization

Advances in Soft Computing (2005) (AISC) 1043-1052

DOI: 10.1007/3-540-32391-0_107

0Citations

3Readers

Get full text

Abstract

Concept learning is an interesting problem in machine learning and has many applications in real-world problems. This paper considers the multiple concept learning which are extended from binary concept learning. Our main contribution in this paper is to propose a new framework for multiple concept learning. To this end, two sparseness and semantic measures are proposed in order to characterize the scatter and the concentration of concepts in a system. Using both of these two measures, a general strategy of multiple concept learning is given and applied to feature selection in text categorization problem. The experimental results implemented to two benchmark datasets 20Newsgroups and Reuters-21578 show that our approach improve the performances using the Rochio and naive Bayes algorithms compared to conventional methods in the system.

Author supplied keywords

Cite

CITATION STYLE

APA

Doan, S., & Horiguchi, S. (2005). Multiple concept learning - A novel approach to feature selection in text categorization. In Advances in Soft Computing (pp. 1043–1052). Springer Verlag. https://doi.org/10.1007/3-540-32391-0_107

Multiple concept learning - A novel approach to feature selection in text categorization

Abstract

Author supplied keywords

Cite

Register to see more suggestions