CRISOL: An approach for automatically populating semantic web from unstructured text collections

Roxana Danger; Rafael Berlanga; José Ruíz-Shulcloper

Journal Article

CRISOL: An approach for automatically populating semantic web from unstructured text collections

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3180 243-252

DOI: 10.1007/978-3-540-30075-5_24

3Citations

7Readers

Get full text

Abstract

Currently, the main drawback for the development of the Semantic Web stems from the manual tagging of web pages according to a given ontology that conceptualizes its domain. This tasks is usually hard, even for experts, and it is prone to errors due to the different interpretations users can have about the same documents. In this paper we address the problem of automatically generating ontology instances starting from a collection of unstructured documents (e.g. plain texts, HTML pages, etc.). These instances will populate the Semantic Web that is described by the ontology. The proposed approach combines Information Extraction techniques, mainly entity recognition, information merging and Text Mining techniques. This approach has been successfully applied in the development of a Semantic Web for the Archaeology Research. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Danger, R., Berlanga, R., & Ruíz-Shulcloper, J. (2004). CRISOL: An approach for automatically populating semantic web from unstructured text collections. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3180, 243–252. https://doi.org/10.1007/978-3-540-30075-5_24

CRISOL: An approach for automatically populating semantic web from unstructured text collections

Abstract

Cite

Register to see more suggestions