A web table extraction method based on structure and ontology

2Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The table extraction is an important issue of Webpage information analysis. At present, there are three mainly methods, which is how to construct the wrapper, how to construct the ontology and directly analysis the structure of a table on the webpage. In the process of analysis, usually these methods are applied independently. Aiming at the shortcomings of single method, this paper presents a synthetic method based on the ontology and structure. In this paper, we firstly locates the tables based on heuristic rules, and then analysis the table structure according to the label and the title ontology, at last extract and save the table data on the basis of the obtained characteristics. The experiments show that the introduction of the ontology greatly improved the accuracy of table structure recognition, and the precision and recall of the methods are better.

Cite

CITATION STYLE

APA

Guo, C., Ma, S., & Yuan, D. (2014). A web table extraction method based on structure and ontology. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8933, 695–704. https://doi.org/10.1007/978-3-319-14717-8_55

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free