C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction

Mauro Dalle Lucca Tosi; Julio Cesar dos Reis

Conference Proceedings

C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction

Communications in Computer and Information Science (2019) 1057 CCIS 236-247

DOI: 10.1007/978-3-030-36599-8_21

4Citations

5Readers

Get full text

Abstract

Keyphrase extraction is the task of identifying a set of phrases that best represent a natural language document. It is a fundamental and challenging task that assists publishers to index and recommend relevant documents to readers. In this article, we introduce C-Rank, a novel unsupervised approach to automatically extract keyphrases from single documents by using concept linking. Our method explores Babelfy to identify candidate keyphrases, which are weighted based on heuristics and their centrality inside a co-occurrence graph where keyphrases appear as vertices. It improves the results obtained by graph-based techniques without training nor background data inserted by users. Evaluations are performed on SemEval and INSPEC datasets, producing competitive results with state-of-the-art tools. Furthermore, C-Rank generates intermediate structures with semantically annotated data that can be used to analyze larger textual compendiums, which might improve domain understatement and enrich textual representation methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Tosi, M. D. L., & dos Reis, J. C. (2019). C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction. In Communications in Computer and Information Science (Vol. 1057 CCIS, pp. 236–247). Springer. https://doi.org/10.1007/978-3-030-36599-8_21

C-Rank: A Concept Linking Approach to Unsupervised Keyphrase Extraction

Abstract

Author supplied keywords

Cite

Register to see more suggestions