Vietnamese author name disambiguation for integrating publications from heterogeneous sources

Tin Huynh; Kiem Hoang; Tien Do; Duc Huynh

Conference Proceedings

Vietnamese author name disambiguation for integrating publications from heterogeneous sources

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7802 LNAI(PART 1) 226-235

DOI: 10.1007/978-3-642-36546-1_24

12Citations

15Readers

Get full text

Abstract

Automatic integration of bibliographical data from various sources is a really critical task in the field of digital libraries. One of the most important challenges for this process is the author name disambiguation. In this paper, we applied supervised learning approach and proposed a set of features that can be used to assist training classifiers in disambiguating Vietnamese author names. In order to evaluate efficiency of the proposed features set, we did experiments on five supervised learning methods: Random Forest, Support Vector Machine (SVM), k-Nearest Neighbors (kNN), C4.5 (Decision Tree), Bayes. The experiment dataset collected from three online digital libraries such as Microsoft Academic Search, ACM Digital Library, IEEE Digital Library. Our experiments shown that kNN, Random Forest, C4.5 classifier outperform than the others. The average accuracy archived with kNN approximates 94.55%, random forest is 94.23%, C4.5 is 93.98%, SVM is 91.91% and Bayes is lowest with 81.56%. Summary, we archived the highest accuracy 98.39% for author name disambiguation problem with the proposed feature set in our experiments on the Vietnamese authors dataset. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Huynh, T., Hoang, K., Do, T., & Huynh, D. (2013). Vietnamese author name disambiguation for integrating publications from heterogeneous sources. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7802 LNAI, pp. 226–235). https://doi.org/10.1007/978-3-642-36546-1_24

Vietnamese author name disambiguation for integrating publications from heterogeneous sources

Abstract

Author supplied keywords

Cite

Register to see more suggestions