ScisummNet: A large annotated corpus and content-impact models for scientific paper summarization with citation networks

Michihiro Yasunaga; Jungo Kasai; Rui Zhang; Alexander R. Fabbri; Irene Li; Dan Friedman; Dragomir R. Radev

Conference ProceedingsOPEN ACCESS

ScisummNet: A large annotated corpus and content-impact models for scientific paper summarization with citation networks

33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (2019) 7386-7393

DOI: 10.1609/aaai.v33i01.33017386

176Citations

128Readers

Abstract

Scientific article summarization is challenging: large, annotated corpora are not available, and the summary should ideally include the article's impacts on research community. This paper provides novel solutions to these two challenges. We 1) develop and release the first large-scale manually-annotated corpus for scientific papers (on computational linguistics) by enabling faster annotation, and 2) propose summarization methods that integrate the authors' original highlights (abstract) and the article's actual impacts on the community (citations), to create comprehensive, hybrid summaries. We conduct experiments to demonstrate the efficacy of our corpus in training data-driven models for scientific paper summarization and the advantage of our hybrid summaries over abstracts and traditional citation-based summaries. Our large annotated corpus and hybrid methods provide a new framework for scientific paper summarization research.

Cite

CITATION STYLE

APA

Yasunaga, M., Kasai, J., Zhang, R., Fabbri, A. R., Li, I., Friedman, D., & Radev, D. R. (2019). ScisummNet: A large annotated corpus and content-impact models for scientific paper summarization with citation networks. In 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019 (pp. 7386–7393). AAAI Press. https://doi.org/10.1609/aaai.v33i01.33017386

ScisummNet: A large annotated corpus and content-impact models for scientific paper summarization with citation networks

Abstract

Cite

Register to see more suggestions