Text categorization as a graph classification problem

François Rousseau; Emmanouil Kiagias; Michalis Vazirgiannis

Conference ProceedingsOPEN ACCESS

Text categorization as a graph classification problem

ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (2015) 1 1702-1712

DOI: 10.3115/v1/p15-1164

127Citations

200Readers

Abstract

In this paper, we consider the task of text categorization as a graph classification problem. By representing textual documents as graph-of-words instead of historical n-gram bag-of-words, we extract more discriminative features that correspond to long-distance n-grams through frequent subgraph mining. Moreover, by capitalizing on the concept of k-core, we reduce the graph representation to its densest part - its main core - speeding up the feature extraction step for little to no cost in prediction performances. Experiments on four standard text classification datasets show statistically significant higher accuracy and macro-Averaged F1-score compared to baseline approaches.

Cite

CITATION STYLE

APA

Rousseau, F., Kiagias, E., & Vazirgiannis, M. (2015). Text categorization as a graph classification problem. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 1702–1712). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-1164

Text categorization as a graph classification problem

Abstract

Cite

Register to see more suggestions