CN-DBpedia: A never-ending chinese knowledge extraction system

Bo Xu; Yong Xu; Jiaqing Liang; Chenhao Xie; Bin Liang; Wanyun Cui; Yanghua Xiao

Conference Proceedings

CN-DBpedia: A never-ending chinese knowledge extraction system

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10351 LNCS 428-438

DOI: 10.1007/978-3-319-60045-1_44

217Citations

76Readers

Get full text

Abstract

Great efforts have been dedicated to harvesting knowledge bases from online encyclopedias. These knowledge bases play important roles in enabling machines to understand texts. However, most current knowledge bases are in English and non-English knowledge bases, especially Chinese ones, are still very rare. Many previous systems that extract knowledge from online encyclopedias, although are applicable for building a Chinese knowledge base, still suffer from two challenges. The first is that it requires great human efforts to construct an ontology and build a supervised knowledge extraction model. The second is that the update frequency of knowledge bases is very slow. To solve these challenges, we propose a never-ending Chinese Knowledge extraction system, CN-DBpedia, which can automatically generate a knowledge base that is of ever-increasing in size and constantly updated. Specially, we reduce the human costs by reusing the ontology of existing knowledge bases and building an end-to-end facts extraction model. We further propose a smart active update strategy to keep the freshness of our knowledge base with little human costs. The 164 million API calls of the published services justify the success of our system.

Cite

CITATION STYLE

APA

Xu, B., Xu, Y., Liang, J., Xie, C., Liang, B., Cui, W., & Xiao, Y. (2017). CN-DBpedia: A never-ending chinese knowledge extraction system. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10351 LNCS, pp. 428–438). Springer Verlag. https://doi.org/10.1007/978-3-319-60045-1_44

CN-DBpedia: A never-ending chinese knowledge extraction system

Abstract

Cite

Register to see more suggestions