Exploration of deep web repositories

6Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

With the proliferation of online repositories (e.g., databases or document corpora) hidden behind proprietary web interfaces, e.g., keyword-/form-based search and hierarchical/graph-based browsing interfaces, efficient ways of exploring contents in such hidden repositories are of increasing importance. There are two key challenges: one on the proper understanding of interfaces, and the other on the efficient exploration, e.g., crawling, sampling and analytical processing, of very large repositories. In this tutorial, we focus on the fundamental developments in the field, including web interface understanding, crawling, sampling, and data analytics over web repositories with various types of interfaces and containing structured or unstructured data. Our goal is to encourage audience to initiate their own research in these exciting areas. © 2011 VLDB Endowment.

Cite

CITATION STYLE

APA

Zhang, N., & Das, G. (2011). Exploration of deep web repositories. Proceedings of the VLDB Endowment, 4(12), 1506–1507. https://doi.org/10.14778/3402755.3402808

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free