Using semantics and statistics to turn data into knowledge

Jay Pujara; Hui Miao; Lise Getoor; William W. Cohen

Journal ArticleOPEN ACCESS

Using semantics and statistics to turn data into knowledge

AI Magazine (2015) 36(1) 65-74

DOI: 10.1609/aimag.v36i1.2568

19Citations

56Readers

Abstract

Many information-extraction and knowledge base construction systems are addressing the challenge of deriving knowledge from text. A key problem in constructing these knowledge bases from sources like the web is overcoming the erroneous and incomplete information found in millions of candidate extractions. To solve this problem, we turn to semantics - using ontological constraints between candidate facts to eliminate errors. In this article, we represent the desired knowledge base as a knowledge graph and introduce the problem of knowledge graph identification, collectively resolving the entities, labels, and relations present in the knowledge graph. Knowledge graph identification requires reasoning jointly over millions of extractions simultaneously, posing a scalability challenge to many approaches. We use probabilistic soft logic (PSL), a recently introduced statistical relational learning framework, to implement an efficient solution to knowledge graph identification and present state-of-the-art results for knowledge graph construction while performing an order of magnitude faster than competing methods.

Cite

CITATION STYLE

APA

Pujara, J., Miao, H., Getoor, L., & Cohen, W. W. (2015). Using semantics and statistics to turn data into knowledge. AI Magazine, 36(1), 65–74. https://doi.org/10.1609/aimag.v36i1.2568

Using semantics and statistics to turn data into knowledge

Abstract

Cite

Register to see more suggestions