Most existing text classification methods (and text mining methods at large) are based on representing the documents using the traditional vector space model. We argue that important information, such as the relationship among words, is lost. We propose a term graph model to represent not only the content of a document but also the relationship among the keywords. We demonstrate that the new model enables us to define new similarity functions, such as considering rank correlation based on PageRank-style algorithms, for the classification purpose. Our preliminary results show promising results of our new model. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Wang, W., Do, D. B., & Lin, X. (2005). Term graph model for text classification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3584 LNAI, pp. 19–30). Springer Verlag. https://doi.org/10.1007/11527503_5
Mendeley helps you to discover research relevant for your work.