Incorporating expert knowledge into keyphrase extraction

Sujatha Das Gollapalli; Xiao Li Li; Peng Yang

Conference ProceedingsOPEN ACCESS

Incorporating expert knowledge into keyphrase extraction

31st AAAI Conference on Artificial Intelligence, AAAI 2017 (2017) 3180-3187

DOI: 10.1609/aaai.v31i1.10986

79Citations

57Readers

Abstract

Keyphrases that efficiently summarize a document's content are used in various document processing and retrieval tasks. Current state-of-the-art techniques for keyphrase extraction operate at a phrase-level and involve scoring candidate phrases based on features of their component words. In this paper, we learn keyphrase taggers for research papers using token-based features incorporating linguistic, surface-form, and document-structure information through sequence labeling. We experimentally illustrate that using within-document features alone, our tagger trained with Conditional Random Fields performs on-par with existing state-of-the-art systems that rely on information from Wikipedia and citation networks. In addition, we are also able to harness recent work on feature labeling to seamlessly incorporate expert knowledge and predictions from existing systems to enhance the extraction performance further. We highlight the modeling advantages of our keyphrase taggers and show significant performance improvements on two recently-compiled datasets of keyphrases from Computer Science research papers.

Cite

CITATION STYLE

APA

Das Gollapalli, S., Li, X. L., & Yang, P. (2017). Incorporating expert knowledge into keyphrase extraction. In 31st AAAI Conference on Artificial Intelligence, AAAI 2017 (pp. 3180–3187). AAAI press. https://doi.org/10.1609/aaai.v31i1.10986

Incorporating expert knowledge into keyphrase extraction

Abstract

Cite

Register to see more suggestions