Arabic/English multi-document summarization with CLASSY - The past and the future

40Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Automatic document summarization has become increasingly important due to the quantity of written material generated world-wide. Generating good quality summaries enables users to cope with larger amounts of information. English-document summarization is a difficult task. Yet it is not sufficient. Environmental, economic, and other global issues make it imperative for English speakers to understand how other countries and cultures perceive and react to important events. CLASSY (Clustering, Linguistics, And Statistics for Summarization Yield) is an automatic, extract-generating, summarization system that uses linguistic trimming and statistical methods to generate generic or topic(/query)-driven summaries for single documents or clusters of documents. CLASSY has performed well in the Document Understanding Conference (DUC) evaluations and the Multi-lingual (Arabic/English) Summarization Evaluations (MSE). We present a description of CLASSY. We follow this with experiments and results from the MSE evaluations and conclude with a discussion of on-going work to improve the quality of the summaries-both English-only and multi-lingual-that CLASSY generates. © 2008 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Schlesinger, J. D., O’Leary, D. P., & Conroy, J. M. (2008). Arabic/English multi-document summarization with CLASSY - The past and the future. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4919 LNCS, pp. 568–581). https://doi.org/10.1007/978-3-540-78135-6_49

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free