Author disambiguation is a prerequisite for utilizing bibliographic metadata in citation analysis. Automatic disambiguation algorithms mostly rely on cluster-based disambiguation strategies for identifying unique authors given their names and publications. However, most approaches rely on knowing the correct number of unique authors a-priori, which is rarely the case in real world settings. In this publication we analyse cluster-based disambiguation strategies and develop a model selection method to estimate the number of distinct authors based on co-authorship networks. We show that, given clean textual features, the developed model selection method provides accurate guesses of the number of unique authors. © 2011 IEEE.
CITATION STYLE
Kern, R., Zechner, M., & Granitzer, M. (2011). Model selection strategies for author disambiguation. In Proceedings - International Workshop on Database and Expert Systems Applications, DEXA (pp. 155–159). https://doi.org/10.1109/DEXA.2011.54
Mendeley helps you to discover research relevant for your work.