Naïve algorithms for keyphrase extraction and text summarization from a single document inspired by the protein biosynthesis process

Daniel Gayo-Avello; Darío Álvarez-Gutiérrez; José Gayo-Avello

Journal Article

Naïve algorithms for keyphrase extraction and text summarization from a single document inspired by the protein biosynthesis process

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3141 440-455

DOI: 10.1007/978-3-540-27835-1_32

11Citations

7Readers

Get full text

Abstract

Keywords are a simple way of describing a document, giving the reader some clues about its contents. However, sometimes they only categorize the text into a topic being more useful a summary. Keywords and abstracts are common in scientific and technical literature but most of the documents available (e.g., web pages) lack such help, so automatic keyword extraction and summarization tools are fundamental to fight against the "information overload" and improve the users' experience. Therefore, this paper describes a new technique to obtain keyphrases and summaries from a single document. With this technique, inspired by the process of protein biosynthesis, a sort of "document DNA" can be extracted and translated into a "significance protein" which both produces a set of keyphrases and acts on the document highlighting the most relevant passages. These ideas have been implemented into a prototype, publicly available in the Web, which has obtained really promising results. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Gayo-Avello, D., Álvarez-Gutiérrez, D., & Gayo-Avello, J. (2004). Naïve algorithms for keyphrase extraction and text summarization from a single document inspired by the protein biosynthesis process. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3141, 440–455. https://doi.org/10.1007/978-3-540-27835-1_32

Naïve algorithms for keyphrase extraction and text summarization from a single document inspired by the protein biosynthesis process

Abstract

Cite

Register to see more suggestions