C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction

4Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Keyphrase extraction is the task of identifying a set of phrases that best represent a natural language document. It is a fundamental and challenging task that assists publishers to index and recommend relevant documents to readers. In this article, we introduce C-Rank, a novel unsupervised approach to automatically extract keyphrases from single documents by using concept linking. Our method explores Babelfy to identify candidate keyphrases, which are weighted based on heuristics and their centrality inside a co-occurrence graph where keyphrases appear as vertices. It improves the results obtained by graph-based techniques without training nor background data inserted by users. Evaluations are performed on SemEval and INSPEC datasets, producing competitive results with state-of-the-art tools. Furthermore, C-Rank generates intermediate structures with semantically annotated data that can be used to analyze larger textual compendiums, which might improve domain understatement and enrich textual representation methods.

Cite

CITATION STYLE

APA

Tosi, M. D. L., & dos Reis, J. C. (2019). C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction. In Communications in Computer and Information Science (Vol. 1057 CCIS, pp. 236–247). Springer. https://doi.org/10.1007/978-3-030-36599-8_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free