Affiliation disambiguation for constructing semantic digital libraries

Yong Jiang; Hai Tao Zheng; Xinmin Wang; Binggan Lu; Kaihua Wu

Journal ArticleOPEN ACCESS

Affiliation disambiguation for constructing semantic digital libraries

Journal of the American Society for Information Science and Technology (2011) 62(6) 1029-1041

DOI: 10.1002/asi.21538

23Citations

36Readers

Get full text

Abstract

With increasing digital information availability, semantic web technologies have been employed to construct semantic digital libraries in order to ease information comprehension. The use of semantic web enables users to search or visualize resources in a semantic fashion. Semantic web generation is a key process in semantic digital library construction, which converts metadata of digital resources into semantic web data. Many text mining technologies, such as keyword extraction and clustering, have been proposed to generate semantic web data. However, one important type of metadata in publications, called affiliation, is hard to convert into semantic web data precisely because different authors, who have the same affiliation, often express the affiliation in different ways. To address this issue, this paper proposes a clustering method based on normalized compression distance for the purpose of affiliation disambiguation. The experimental results show that our method is able to identify different affiliations that denote the same institutes. The clustering results outperform the well-known k-means clustering method in terms of average precision, F-measure, entropy, and purity. © 2011 ASIS&T.

Cite

CITATION STYLE

APA

Jiang, Y., Zheng, H. T., Wang, X., Lu, B., & Wu, K. (2011). Affiliation disambiguation for constructing semantic digital libraries. Journal of the American Society for Information Science and Technology, 62(6), 1029–1041. https://doi.org/10.1002/asi.21538

Affiliation disambiguation for constructing semantic digital libraries

Abstract

Cite

Register to see more suggestions