Keyphrase extraction as sequence labeling using contextualized embeddings

Dhruva Sahrawat; Debanjan Mahata; Haimin Zhang; Mayank Kulkarni; Agniv Sharma; Rakesh Gosangi; Amanda Stent; Yaman Kumar; Rajiv Ratn Shah; Roger Zimmermann

Conference ProceedingsOPEN ACCESS

Keyphrase extraction as sequence labeling using contextualized embeddings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12036 LNCS 328-335

DOI: 10.1007/978-3-030-45442-5_41

62Citations

33Readers

Get full text

Abstract

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings. We evaluate the proposed architecture using both contextualized and fixed word embedding models on three different benchmark datasets, and compare with existing popular unsupervised and supervised techniques. Our results quantify the benefits of: (a) using contextualized embeddings over fixed word embeddings; (b) using a BiLSTM-CRF architecture with contextualized word embeddings over fine-tuning the contextualized embedding model directly; and (c) using domain-specific contextualized embeddings (SciBERT). Through error analysis, we also provide some insights into why particular models work better than the others. Lastly, we present a case study where we analyze different self-attention layers of the two best models (BERT and SciBERT) to better understand their predictions.

Author supplied keywords

Cite

CITATION STYLE

APA

Sahrawat, D., Mahata, D., Zhang, H., Kulkarni, M., Sharma, A., Gosangi, R., … Zimmermann, R. (2020). Keyphrase extraction as sequence labeling using contextualized embeddings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12036 LNCS, pp. 328–335). Springer. https://doi.org/10.1007/978-3-030-45442-5_41

Keyphrase extraction as sequence labeling using contextualized embeddings

Abstract

Author supplied keywords

Cite

Register to see more suggestions