Paper2vec: Combining graph and text information for scientific paper representation

Soumyajit Ganguly; Vikram Pudi

Conference Proceedings

Paper2vec: Combining graph and text information for scientific paper representation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10193 LNCS 383-395

DOI: 10.1007/978-3-319-56608-5_30

59Citations

67Readers

Get full text

Abstract

We present Paper2vec, a novel neural network embedding based approach for creating scientific paper representations which make use of both textual and graph-based information. An academic citation network can be viewed as a graph where individual nodes contain rich textual information. With the current trend of open-access to most scientific literature, we presume that this full text of a scientific article contain vital source of information which aids in various recommendation and prediction tasks concerning this domain. To this end, we propose an approach, Paper2vec, which comprises of information from both the modalities and results in a rich representation for scientific papers. Over the recent past representation learning techniques have been studied extensively using neural networks. However, they are modeled independently for text and graph data. Paper2vec leverages recent research in the broader field of unsupervised feature learning from both graphs and text documents. We demonstrate the efficacy of our representations on three real world academic datasets in two tasks - node classification and link prediction where Paper2vec is able to outperform state-of-the-art by a considerable margin.

Author supplied keywords

Cite

CITATION STYLE

APA

Ganguly, S., & Pudi, V. (2017). Paper2vec: Combining graph and text information for scientific paper representation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10193 LNCS, pp. 383–395). Springer Verlag. https://doi.org/10.1007/978-3-319-56608-5_30

Paper2vec: Combining graph and text information for scientific paper representation

Abstract

Author supplied keywords

Cite

Register to see more suggestions