Citation field learning by RNN with limited training data

Yiqing Zhang; Yimeng Dai; Jianzhong Qi; Xinxing Xu; Rui Zhang

Conference Proceedings

Citation field learning by RNN with limited training data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11154 LNAI 219-232

DOI: 10.1007/978-3-030-04503-6_23

0Citations

2Readers

Get full text

Abstract

Citation field learning is to segment a citation string into fields of interest such as author, title, and venue from plain text. We are interested in citation field learning from researchers’ homepages. This task is challenging due to the free citation styles used by different creators of the homepages. We aim to address the challenge by neural network based approaches which learn the citation field styles automatically. Neural network based approaches are data-hungry, but manually labeled training data is expensive to obtain. Therefore, we propose a novel framework that utilizes auto-generated training data and domain adaptation to enhance a manually labeled training dataset of limited size. At the same time, we design an adaptive Recurrent Neural Network (RNN) to learn citation styles from the enhanced training data effectively. Extensive experiments show that the proposed methods outperform state-of-the-art methods for citation field learning.

Cite

CITATION STYLE

APA

Zhang, Y., Dai, Y., Qi, J., Xu, X., & Zhang, R. (2018). Citation field learning by RNN with limited training data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11154 LNAI, pp. 219–232). Springer Verlag. https://doi.org/10.1007/978-3-030-04503-6_23

Citation field learning by RNN with limited training data

Abstract

Cite

Register to see more suggestions