Automatic annotation of corpora for text summarisation: A comparative study

2Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper presents two methods which automatically produce annotated corpora for text summarisation on the basis of human produced abstracts. Both methods identify a set of sentences from the document which conveys the information in the human produced abstract best. The first method relies on a greedy algorithm, whilst the second one uses a genetic algorithm. The methods allow to specify the number of sentences to be annotated, which constitutes an advantage over the existing methods. Comparison between the two approaches investigated here revealed that the genetic algorithm is appropriate in cases where the number of sentences to be annotated is less than the number of sentences in an ideal gold standard with no length restrictions, whereas the greedy algorithm should be used in other cases. © Springer-Verlag Berlin Heidelberg 2005.

Cite

CITATION STYLE

APA

Orǎsan, C. (2005). Automatic annotation of corpora for text summarisation: A comparative study. In Lecture Notes in Computer Science (Vol. 3406, pp. 670–681). https://doi.org/10.1007/978-3-540-30586-6_75

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free