LODStats - An extensible framework for high-performance dataset analytics

125Citations
Citations of this article
69Readers
Mendeley users who have this article in their library.
Get full text

Abstract

One of the major obstacles for a wider usage of web data is the difficulty to obtain a clear picture of the available datasets. In order to reuse, link, revise or query a dataset published on the Web it is important to know the structure, coverage and coherence of the data. In order to obtain such information we developed LODStats - a statement-stream-based approach for gathering comprehensive statistics about datasets adhering to the Resource Description Framework (RDF). LODStats is based on the declarative description of statistical dataset characteristics. Its main advantages over other approaches are a smaller memory footprint and significantly better performance and scalability. We integrated LODStats with the CKAN dataset metadata registry and obtained a comprehensive picture of the current state of a significant part of the Data Web. © 2012 Springer-Verlag.

Cite

CITATION STYLE

APA

Auer, S., Demter, J., Martin, M., & Lehmann, J. (2012). LODStats - An extensible framework for high-performance dataset analytics. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7603 LNAI, pp. 353–362). https://doi.org/10.1007/978-3-642-33876-2_31

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free