Semistructured data

281Citations
Citations of this article
212Readers
Mendeley users who have this article in their library.

Abstract

In semistructured data, the information that is normally associated with a schema is contained within the data, which is sometimes called `self-describing'. In some forms of semistructured data there is no separate schema, in others it exists but only places loose constraints on the data. Semistructured data has recently emerged as an important topic of study for a variety of reasons. First, there are data sources such as the Web, which we would like to treat as databases but which cannot be constrained by a schema. Second, it may be desirable to have an extremely flexible format for data exchange between disparate databases. Third, even when dealing with structured data, it may be helpful to view it as semistructured for the purposes of browsing. This tutorial will cover a number of issues surrounding such data: finding a concise formulation, building a sufficiently expressive language for querying and transformation, and optimization problems.

Cite

CITATION STYLE

APA

Buneman, P. (1997). Semistructured data. In Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems - PODS (pp. 117–121). ACM. https://doi.org/10.1145/263661.263675

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free