A web table extraction method based on structure and ontology

Cai Guo; Shun Ma; Dingrong Yuan

Journal Article

A web table extraction method based on structure and ontology

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8933 695-704

DOI: 10.1007/978-3-319-14717-8_55

2Citations

1Readers

Get full text

Abstract

The table extraction is an important issue of Webpage information analysis. At present, there are three mainly methods, which is how to construct the wrapper, how to construct the ontology and directly analysis the structure of a table on the webpage. In the process of analysis, usually these methods are applied independently. Aiming at the shortcomings of single method, this paper presents a synthetic method based on the ontology and structure. In this paper, we firstly locates the tables based on heuristic rules, and then analysis the table structure according to the label and the title ontology, at last extract and save the table data on the basis of the obtained characteristics. The experiments show that the introduction of the ontology greatly improved the accuracy of table structure recognition, and the precision and recall of the methods are better.

Author supplied keywords

Cite

CITATION STYLE

APA

Guo, C., Ma, S., & Yuan, D. (2014). A web table extraction method based on structure and ontology. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8933, 695–704. https://doi.org/10.1007/978-3-319-14717-8_55

A web table extraction method based on structure and ontology

Abstract

Author supplied keywords

Cite

Register to see more suggestions