A self-training approach for resolving object coreference on the semantic web

Wei Hu; Jianfeng Chen; Yuzhong Qu

Conference Proceedings

A self-training approach for resolving object coreference on the semantic web

Proceedings of the 20th International Conference on World Wide Web, WWW 2011 (2011) 87-96

DOI: 10.1145/1963405.1963421

113Citations

41Readers

Get full text

Abstract

An object on the Semantic Web is likely to be denoted with multiple URIs by different parties. Object coreference resolution is to identify "equivalent" URIs that denote the same object. Driven by the Linking Open Data (LOD) initiative, millions of URIs have been explicitly linked with owl:sameAs statements, but potentially coreferent ones are still considerable. Existing approaches address the problem mainly from two directions: one is based upon equivalence inference mandated by OWL semantics, which finds semantically coreferent URIs but probably omits many potential ones; the other is via similarity computation between property-value pairs, which is not always accurate enough. In this paper, we propose a self-training approach for object coreference resolution on the Semantic Web, which leverages the two classes of approaches to bridge the gap between semantically coreferent URIs and potential candidates. For an object URI, we firstly establish a kernel that consists of semantically coreferent URIs based on owl:sameAs, (inverse) functional properties and (max-)cardinalities, and then extend such kernel iteratively in terms of discriminative property-value pairs in the descriptions of URIs. In particular, the discriminability is learnt with a statistical measurement, which not only exploits key characteristics for representing an object, but also takes into account the matchability between properties from pragmatics. In addition, frequent property combinations are mined to improve the accuracy of the resolution. We implement a scalable system and demonstrate that our approach achieves good precision and recall for resolving object coreference, on both benchmark and large-scale datasets. Copyright © 2011 by the Association for Computing Machinery, Inc. (ACM).

Author supplied keywords

Cite

CITATION STYLE

APA

Hu, W., Chen, J., & Qu, Y. (2011). A self-training approach for resolving object coreference on the semantic web. In Proceedings of the 20th International Conference on World Wide Web, WWW 2011 (pp. 87–96). https://doi.org/10.1145/1963405.1963421

A self-training approach for resolving object coreference on the semantic web

Abstract

Author supplied keywords

Cite

Register to see more suggestions