Web-scale data integration: You can only afford to pay as you go

248Citations
Citations of this article
184Readers
Mendeley users who have this article in their library.

Abstract

The World Wide Web is witnessing an increase in the amount of structured content - vast heterogeneous collections of structured data are on the rise due to the Deep Web, annotation schemes like Flickr, and sites like Google Base. While this phenomenon is creating an opportunity for structured data management, dealing with heterogeneity on the web-scale presents many new challenges. In this paper, we highlight these challenges in two scenarios - the Deep Web and Google Base. We contend that traditional data integration techniques are no longer valid in the face of such heterogeneity and scale. We propose a new data integration architecture, PAYGO, which is inspired by the concept of dataspaces and emphasizes pay-as-you-go data management as means for achieving web-scale data integration.

Cite

CITATION STYLE

APA

Madhavan, J., Jeffery, S. R., Cohen, S., Dong, X., Ko, D., Yu, C., & Halevy, A. (2007). Web-scale data integration: You can only afford to pay as you go. In CIDR 2007 - 3rd Biennial Conference on Innovative Data Systems Research (pp. 342–350).

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free