At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization

Qingyu Zhou; Furu Wei; Ming Zhou

Conference ProceedingsOPEN ACCESS

At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization

COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (2020) 5617-5628

DOI: 10.18653/v1/2020.coling-main.492

13Citations

92Readers

Abstract

Extractive methods have been proven effective in automatic document summarization. Previous works perform this task by identifying informative contents at sentence level. However, it is unclear whether performing extraction at sentence level is the best solution. In this work, we show that unnecessity and redundancy issues exist when extracting full sentences, and extracting sub-sentential units is a promising alternative. Specifically, we propose extracting sub-sentential units based on the constituency parsing tree. A neural extractive model which leverages the sub-sentential information and extracts them is presented. Extensive experiments and analyses show that extracting sub-sentential units performs competitively comparing to full sentence extraction under the evaluation of both automatic and human evaluations. Hopefully, our work could provide some inspiration of the basic extraction units in extractive summarization for future research.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Zhou, Q., Wei, F., & Zhou, M. (2020). At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 5617–5628). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.492

Readers' Seniority

PhD / Post grad / Masters / Doc 21

64%

Researcher 8

24%

Lecturer / Post doc 3

Professor / Associate Prof. 1

Readers' Discipline

Computer Science 28

76%

Linguistics 5

14%

Engineering 3

Chemistry 1

At Which Level Should We Extract? An Empirical Analysis on Extractive Document Summarization

Abstract

References Powered by Scopus

Get to the point: Summarization with pointer-generator networks

LexRank: Graph-based lexical centrality as salience in text summarization

Neural summarization by extracting sentences and words

Cited by Powered by Scopus

Summarization of German Court Rulings

Exploring optimal granularity for extractive summarization of unstructured health records: Analysis of the largest multi-institutional archive of health records in Japan

Extractive Summarization of Chinese Judgment Documents via Sentence Embedding and Memory Network

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline