ANDMC: An Algorithm for Author Name Disambiguation Based on Molecular Cross Clustering

Siyang Zhang; Xinhua E; Tao Huang; Fan Yang

Conference Proceedings

ANDMC: An Algorithm for Author Name Disambiguation Based on Molecular Cross Clustering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11448 LNCS 173-185

DOI: 10.1007/978-3-030-18590-9_12

3Citations

15Readers

Get full text

Abstract

With the rapid development of information technology, the problem of name ambiguity has become one of the main problems in the fields of information retrieval, data mining and scientific measurement, which inevitably affects the accuracy of information calculations, reduces the credibility of the literature retrieval system, and affect the quality of information. To deal with this, name disambiguation technology has been proposed, which maps virtual relational networks to real social networks. However, most existing related work did not consider the problem of name coreference and the inability to correctly match due to the different writing formats between two same strings. This paper mainly proposes an algorithm for Author Name Disambiguation based on Molecular Cross Clustering (ANDMC) considering name coreference. Meanwhile, we explored the string matching algorithm called Improved Levenshtein Distance (ILD), which solves the problem of matching between two same strings with different writing format. The experimental results show that our algorithm outperforms the baseline method. (F1-score 9.48% 21.45% higher than SC and HAC).

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, S., E, X., Huang, T., & Yang, F. (2019). ANDMC: An Algorithm for Author Name Disambiguation Based on Molecular Cross Clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11448 LNCS, pp. 173–185). Springer Verlag. https://doi.org/10.1007/978-3-030-18590-9_12

ANDMC: An Algorithm for Author Name Disambiguation Based on Molecular Cross Clustering

Abstract

Author supplied keywords

Cite

Register to see more suggestions