A new domain independent keyphrase extraction system

13Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams from input document. We incorporate linguistic knowledge (i.e., part-of-speech tags), and statistical information (i.e., frequency, position, lifespan) of each n-gram in defining candidate phrases and their respective feature sets. The proposed approach can be applied to any document, however, in order to know the effectiveness of the system for digital libraries, we have carried out the evaluation on a set of scientific documents, and compared our results with current keyphrase extraction systems. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Pudota, N., Dattolo, A., Baruzzo, A., & Tasso, C. (2010). A new domain independent keyphrase extraction system. In Communications in Computer and Information Science (Vol. 91 CCIS, pp. 67–78). https://doi.org/10.1007/978-3-642-15850-6_8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free