PubSE: A hierarchical model for publication extraction from academic homepages

4Citations
Citations of this article
80Readers
Mendeley users who have this article in their library.

Abstract

Publication information in a researcher's academic homepage provides insights about the researcher's expertise, research interests, and collaboration networks. We aim to extract all the publication strings from a given academic homepage. This is a challenging task because the publication strings in different academic homepages may be located at different positions with different structures. To capture the positional and structural diversity, we propose an end-to-end hierarchical model named PubSE based on Bi-LSTM-CRF. We further propose an alternating training method for training the model. Experiments on real data show that PubSE outperforms the state-of-the-art models by up to 11.8% in F1-score.

References Powered by Scopus

Convolutional neural networks for sentence classification

8026Citations
N/AReaders
Get full text

Hierarchical attention networks for document classification

4242Citations
N/AReaders
Get full text

Neural architectures for named entity recognition

2582Citations
N/AReaders
Get full text

Cited by Powered by Scopus

2ED: An efficient entity extraction algorithm using two-level edit-distance

7Citations
N/AReaders
Get full text

Joint recognition of names and publications in academic homepages

3Citations
N/AReaders
Get full text

AN ANCHOR LEARNING APPROACH FOR CITATION FIELD LEARNING

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Zhang, Y., Qi, J., Zhang, R., & Yin, C. (2018). PubSE: A hierarchical model for publication extraction from academic homepages. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 (pp. 1005–1010). Association for Computational Linguistics. https://doi.org/10.18653/v1/d18-1123

Readers over time

‘18‘19‘20‘21‘22‘23‘24‘2506121824

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 21

66%

Researcher 7

22%

Professor / Associate Prof. 2

6%

Lecturer / Post doc 2

6%

Readers' Discipline

Tooltip

Computer Science 29

74%

Linguistics 6

15%

Engineering 2

5%

Business, Management and Accounting 2

5%

Save time finding and organizing research with Mendeley

Sign up for free
0