HyperDoc2vec: Distributed representations of hypertext documents

30Citations
Citations of this article
154Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Hypertext documents, such as web pages and academic papers, are of great importance in delivering information in our daily life. Although being effective on plain documents, conventional text embedding methods suffer from information loss if directly adapted to hyper-documents. In this paper, we propose a general embedding approach for hyper-documents, namely, hyperdoc2vec, along with four criteria characterizing necessary information that hyper-document embedding models should preserve. Systematic comparisons are conducted between hyperdoc2vec and several competitors on two tasks, i.e., paper classification and citation recommendation, in the academic paper domain. Analyses and experiments both validate the superiority of hyperdoc2vec to other models w.r.t. the four criteria.

Cite

CITATION STYLE

APA

Han, J., Song, Y., Zhao, W. X., Shi, S., & Zhang, H. (2018). HyperDoc2vec: Distributed representations of hypertext documents. In ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (Vol. 1, pp. 2384–2394). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p18-1222

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free