A snapshot of the OWL web

Nicolas Matentzoglu; Samantha Bail; Bijan Parsia

Conference ProceedingsOPEN ACCESS

A snapshot of the OWL web

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8218 LNCS(PART 1) 331-346

DOI: 10.1007/978-3-642-41335-3_21

28Citations

25Readers

Abstract

Tool development for and empirical experimentation in OWL ontology engineering require a wide variety of suitable ontologies as input for testing and evaluation purposes and detailed characterisations of real ontologies. Empirical activities often resort to (somewhat arbitrarily) hand curated corpora available on the web, such as the NCBO BioPortal and the TONES Repository, or manually selected sets of well-known ontologies. Findings of surveys and results of benchmarking activities may be biased, even heavily, towards these datasets. Sampling from a large corpus of ontologies, on the other hand, may lead to more representative results. Current large scale repositories and web crawls are mostly uncurated and suffer from duplication, small and (for many purposes) uninteresting ontology files, and contain large numbers of ontology versions, variants, and facets, and therefore do not lend themselves to random sampling. In this paper, we survey ontologies as they exist on the web and describe the creation of a corpus of OWL DL ontologies using strategies such as web crawling, various forms of de-duplications and manual cleaning, which allows random sampling of ontologies for a variety of empirical applications. © 2013 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Matentzoglu, N., Bail, S., & Parsia, B. (2013). A snapshot of the OWL web. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8218 LNCS, pp. 331–346). https://doi.org/10.1007/978-3-642-41335-3_21

A snapshot of the OWL web

Abstract

Author supplied keywords

Cite

Register to see more suggestions