Data product configuration management and versioning in large-scale production of satellite scientific data

4Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper describes a formal structure for keeping track of files, source code, scripts, and related material for large-scale Earth science data production. We first describe the environment and processes that govern this configuration management problem. Then, we show that a graph with typed nodes and arcs can describe the derivation of production design and of the produced files and their metadata. The graph provides three useful by-products: • a hierarchical data file inventory structure that can help system users find particular files, • methods for creating production graphs that govern job scheduling and provenance graphs that track all of the sources and transformations between raw data input and a particular output file, • a systematic relationship between different elements of the structure and development documentation. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Barkstrom, B. R. (2003). Data product configuration management and versioning in large-scale production of satellite scientific data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2649, 118–133. https://doi.org/10.1007/3-540-39195-9_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free