A comparison of metadata extraction techniques for crowdsourced bibliographic metadata management

Michael Granitzer; Maya Hristakeva; Kris Jack; Robert Knight

Conference Proceedings

A comparison of metadata extraction techniques for crowdsourced bibliographic metadata management

Proceedings of the ACM Symposium on Applied Computing (2012) 962-964

DOI: 10.1145/2245276.2245462

14Citations

98Readers

Get full text

Abstract

Social research networks such as Mendeley and CiteULike offer various services for collaboratively managing bibliographic metadata and uploading textual artifacts. One core problem thereby is the extraction of bibliographic metadata from the textual artifacts. Our work investiages the use of Conditional Random Fields and Support Vector Machines, implemented in two state-of-the-art real-world systems, namely ParsCit and the Mendeley Desktop, for automatically extracting bibliographic metadata. We compare the systems' accuracy on two newly created real-world data sets gathered from Mendeley and Linked-Open-Data repositories. Our analysis shows that two-stage SVMs provide reasonable performance in solving the challenge of metadata extraction from user-provided textual artifacts. © 2012 Authors.

Author supplied keywords

Cite

CITATION STYLE

APA

Granitzer, M., Hristakeva, M., Jack, K., & Knight, R. (2012). A comparison of metadata extraction techniques for crowdsourced bibliographic metadata management. In Proceedings of the ACM Symposium on Applied Computing (pp. 962–964). https://doi.org/10.1145/2245276.2245462

A comparison of metadata extraction techniques for crowdsourced bibliographic metadata management

Abstract

Author supplied keywords

Cite

Register to see more suggestions