Modeling and extracting deep-web query interfaces

23Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Interface modeling & extraction is a fundamental step in building a uniform query interface to a multitude of databases on the Web. Existing solutions are limited in that they assume interfaces are flat and thus ignore the inherent structure of interfaces, which then seriously hampers the effectiveness of interface integration. To address this limitation, in this chapter, we model an interface with a hierarchical schema (e.g., an ordered-tree of attributes). We describe ExQ, a novel schema extraction system with two distinct features. First, ExQ discovers the structure of an interface based on its visual representation via spatial clustering. Second, ExQ annotates the discovered schema with labels from the interface by imitating the human-annotation process. ExQ has been extensively evaluated with real-world query interfaces in five different domains and the results show that ExQ achieves above 90% accuracy rate in both structure discovery & schema annotation tasks. © 2009 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Wu, W., Doan, A. H., Yu, C., & Meng, W. (2009). Modeling and extracting deep-web query interfaces. Studies in Computational Intelligence, 251, 65–90. https://doi.org/10.1007/978-3-642-04141-9_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free