Fusing Document, Collection and Label Graph-based Representations with Word Embeddings for Text Classification

5Citations
Citations of this article
84Readers
Mendeley users who have this article in their library.

Abstract

Contrary to the traditional Bag-of-Words approach, we consider the Graph-of-Words (GoW) model in which each document is represented by a graph that encodes relationships between the different terms. Based on this formulation, the importance of a term is determined by weighting the corresponding node in the document, collection and label graphs, using node centrality criteria. We also introduce novel graph-based weighting schemes by enriching graphs with word-embedding similarities, in order to reward or penalize semantic relationships. Our methods produce more discriminative feature weights for text categorization, outperforming existing frequency-based criteria. Code and data are available online.

Cite

CITATION STYLE

APA

Skianis, K., Malliaros, F. D., & Vazirgiannis, M. (2018). Fusing Document, Collection and Label Graph-based Representations with Word Embeddings for Text Classification. In NAACL HLT 2018 - Graph-Based Methods for Natural Language Processing, TextGraphs 2018 - Proceedings of the 12th Workshop (pp. 49–58). Association for Computational Linguistics. https://doi.org/10.18653/v1/w18-1707

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free