With more and more data being published on the Web as Linked Data, Web Data quality is becoming increasingly important. While quite some work has been done with regard to quality assessment of Linked Data, only few works have addressed quality improvement. In this article, we present a preliminary an approach for identifying potentially incorrect RDF statements using distance-based outlier detection. Our method follows a three stage approach, which automates the whole process of finding potentially incorrect statements for a certain property. Our preliminary evaluation shows that a high precision is maintained with different settings.
CITATION STYLE
Debattista, J., Lange, C., & Auer, S. (2016). A preliminary investigation towards improving linked data quality using distance-based outlier detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10055 LNCS, pp. 116–124). Springer Verlag. https://doi.org/10.1007/978-3-319-50112-3_9
Mendeley helps you to discover research relevant for your work.