Assessing trust with pageRank in the web of data

José M. Giménez-García; Harsh Thakkar; Antoine Zimmermann

Conference ProceedingsOPEN ACCESS

Assessing trust with pageRank in the web of data

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9989 LNCS 293-307

DOI: 10.1007/978-3-319-47602-5_45

2Citations

9Readers

Abstract

While a number of quality metrics have been successfully proposed for datasets in the Web of Data, there is a lack of trust metrics that can be computed for any given dataset. We argue that reuse of data can be seen as an act of trust. In the Semantic Web environment, datasets regularly include terms from other sources, and each of these connections express a degree of trust on that source. However, determining what is a dataset in this context is not straightforward. We study the concepts of dataset and dataset link, to finally use the concept of Pay-Level Domain to differentiate datasets, and consider usage of external terms as connections among them. Using these connections we compute the PageRank value for each dataset, and examine the influence of ignoring predicates for computation. This process has been performed for more than 300 datasets, extracted from the LOD Laundromat. The results show that reuse of a dataset is not correlated with its size, and provide some insight on the limitations of the approach and ways to improve its efficacy.

Author supplied keywords

Cite

CITATION STYLE

APA

Giménez-García, J. M., Thakkar, H., & Zimmermann, A. (2016). Assessing trust with pageRank in the web of data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9989 LNCS, pp. 293–307). Springer Verlag. https://doi.org/10.1007/978-3-319-47602-5_45

Assessing trust with pageRank in the web of data

Abstract

Author supplied keywords

Cite

Register to see more suggestions