Skip to content

Automatic keyphrase extraction with a refined candidate set

by Wei You, Dominique Fontaine, Jean Paul Barth??s
Proceedings - 2009 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2009 ()
Get full text at journal


In this paper, we develop and evaluate an automatic keyphrase extraction technique for scientific documents. A new candidate phrase generation method is proposed based on the core word expansion algorithm, which can reduce the size of candidate set by about 75% without increasing the computational complexity. Then in the step of feature calculation, when a phrase and its sub-phrases coexist as candidates, an inverse document frequency related feature is introduced for selecting the proper granularity. Experimental results show the efficiency and effectiveness of the refined candidate set and demonstrate that the overall performance of our system compares favorably with other known keyphrase extraction systems.

Cite this document (BETA)

Readership Statistics

8 Readers on Mendeley
by Discipline
75% Computer Science
13% Business, Management and Accounting
13% Engineering
by Academic Status
63% Student > Master
25% Student > Ph. D. Student
13% Researcher
by Country
13% South Africa
13% Sweden

Sign up today - FREE

Mendeley saves you time finding and organizing research. Learn more

  • All your research in one place
  • Add and import papers easily
  • Access it anywhere, anytime

Start using Mendeley in seconds!

Sign up & Download

Already have an account? Sign in