Unsupervised conceptualization and semantic text indexing for information extraction

Eugen Ruppert

Conference ProceedingsOPEN ACCESS

Unsupervised conceptualization and semantic text indexing for information extraction

Ruppert E

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9678 853-862

DOI: 10.1007/978-3-319-34129-3_54

1Citations

4Readers

Abstract

The goal of my thesis is the extension of the Distributional Hypothesis [13] from the word to the concept level. This will be achieved by creating data-driven methods to create and apply conceptualizations, taxonomic semantic models that are grounded in the input corpus. Such conceptualizations can be used to disambiguate all words in the corpus, so that we can extract richer relations and create a dense graph of semantic relations between concepts. These relations will reduce sparsity issues, a common problem for contextualization techniques. By extending our conceptualization with named entities and multi-word entities (MWE), we can create a Linked Open Data knowledge base that is linked to existing knowledge bases like Freebase.

Cite

CITATION STYLE

APA

Ruppert, E. (2016). Unsupervised conceptualization and semantic text indexing for information extraction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9678, pp. 853–862). Springer Verlag. https://doi.org/10.1007/978-3-319-34129-3_54

Unsupervised conceptualization and semantic text indexing for information extraction

Abstract

Cite

Register to see more suggestions