Author Name Disambiguation Based on Rule and Graph Model

Lizhi Zhang; Zhijie Ban

Conference Proceedings

Author Name Disambiguation Based on Rule and Graph Model

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12430 LNAI 617-628

DOI: 10.1007/978-3-030-60450-9_49

4Citations

6Readers

Get full text

Abstract

Author name disambiguation has long been viewed as a challenging problem in scientific literature management, and with the substantial growth of the scientific literature, the solution to this problem has become increasingly difficult and urgency. In this paper, we conduct research on the author name disambiguation problem in large-scale academic papers. In our method, we combine the paper feature information and the relation information between the papers for disambiguation. Based on the Aminer’s disambiguation framework, we present a novel method to constructing the paper relation graph based on atomic cluster and propose an efficient post processing algorithm, aiming to improve the disambiguation performance by rule-based clustering, this algorithm utilizes similarity features based on metadata information and implement two types of disambiguation rules. We carefully evaluate the proposed disambiguation method on real-world large data and experimental result shows that our method achieves clearly better performance than the state-of-the-art methods.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, L., & Ban, Z. (2020). Author Name Disambiguation Based on Rule and Graph Model. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12430 LNAI, pp. 617–628). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-60450-9_49

Author Name Disambiguation Based on Rule and Graph Model

Abstract

Author supplied keywords

Cite

Register to see more suggestions