At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization

13Citations
Citations of this article
92Readers
Mendeley users who have this article in their library.

Abstract

Extractive methods have been proven effective in automatic document summarization. Previous works perform this task by identifying informative contents at sentence level. However, it is unclear whether performing extraction at sentence level is the best solution. In this work, we show that unnecessity and redundancy issues exist when extracting full sentences, and extracting sub-sentential units is a promising alternative. Specifically, we propose extracting sub-sentential units based on the constituency parsing tree. A neural extractive model which leverages the sub-sentential information and extracts them is presented. Extensive experiments and analyses show that extracting sub-sentential units performs competitively comparing to full sentence extraction under the evaluation of both automatic and human evaluations. Hopefully, our work could provide some inspiration of the basic extraction units in extractive summarization for future research.

References Powered by Scopus

Get to the point: Summarization with pointer-generator networks

2645Citations
N/AReaders
Get full text

LexRank: Graph-based lexical centrality as salience in text summarization

2445Citations
N/AReaders
Get full text

Neural summarization by extracting sentences and words

485Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Summarization of German Court Rulings

7Citations
N/AReaders
Get full text

Exploring optimal granularity for extractive summarization of unstructured health records: Analysis of the largest multi-institutional archive of health records in Japan

5Citations
N/AReaders
Get full text

Extractive Summarization of Chinese Judgment Documents via Sentence Embedding and Memory Network

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhou, Q., Wei, F., & Zhou, M. (2020). At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 5617–5628). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.492

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 21

64%

Researcher 8

24%

Lecturer / Post doc 3

9%

Professor / Associate Prof. 1

3%

Readers' Discipline

Tooltip

Computer Science 28

76%

Linguistics 5

14%

Engineering 3

8%

Chemistry 1

3%

Save time finding and organizing research with Mendeley

Sign up for free