Domain ontologies are usually built by domain expert manually. They are accurate and professional from the perspective of domain dependent concepts, instances and relations among them, nevertheless, maintaining and creating new ontologies need too much manual work, especially when the ontology goes to large scale. Semi-structured data usually contain some semantic relations for concepts and instances, and there are many domain ontologies implicitly exist in these types of data sources. In this paper, we investigate automatic hierarchical domain ontology generation from semi-structured data, more specifically, from HTML and XML documents. The main process of our work includes domain terms extraction, pruning, union and hierarchical structure representation. We illustrate our study based on Artificial Intelligence related conference data represented in HTML and XML documents. © 2011 Springer-Verlag.
CITATION STYLE
Wang, S., Zeng, Y., & Zhong, N. (2011). Ontology extraction and integration from semi-structured data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6890 LNCS, pp. 39–48). https://doi.org/10.1007/978-3-642-23620-4_8
Mendeley helps you to discover research relevant for your work.