A Novel Method for Resolving and Completing Authors' Country Affiliation Data in Bibliographic Records

9Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Our work seeks to overcome data quality issues related to incomplete author affiliation data in bibliographic records in order to support accurate and reliable measurement of international research collaboration (IRC). We propose, implement, and evaluate a method that leverages the Web-based knowledge graph Wikidata to resolve publication affiliation data to particular countries. The method is tested with general and domain-specific data sets. Our evaluation covers the magnitude of improvement, accuracy, and consistency. Results suggest the method is beneficial, reliable, and consistent, and thus a viable and improved approach to measuring IRC. Though our evaluation suggests the method works with both general and domain-specific bibliographic data sets, it may perform differently with data sets not tested here. Further limitations stem from the use of the R programming language and R libraries for country identification as well as imbalanced data coverage and quality in Wikidata that may also change over time. The new method helps to increase the accuracy in IRC studies and provides a basis for further development into a general tool that enriches bibliographic data using the Wikidata knowledge graph. This is the first attempt to enrich bibliographic data using a peer-produced, Web-based knowledge graph like Wikidata.

Cite

CITATION STYLE

APA

Nguyen, B. X., Dinneen, J. D., & Luczak-Roesch, M. (2020). A Novel Method for Resolving and Completing Authors’ Country Affiliation Data in Bibliographic Records. Journal of Data and Information Science, 5(3), 97–115. https://doi.org/10.2478/jdis-2020-0020

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free