High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies

38Citations
Citations of this article
23Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In numerous scientific disciplines, terabyte and soon petabyte-scale data collections are emerging as critical community resources. A new class of Data Grid infrastructure is required to support management, transport, distributed access to, and analysis of these datasets by potentially thousands of users. Researchers who face this challenge include the Climate Modeling community, which performs long-duration computations accompanied by frequent output of very large files that must be further analyzed. We describe the Earth System Grid prototype, which brings together advanced analysis, replica management, data transfer, request management, and other technologies to support high-performance, interactive analysis of replicated data. We present performance results that demonstrate our ability to manage the location and movement of large datasets from the user's desktop. We report on experiments conducted over SciNET at SC'2000, where we achieved peak performance of 1.55Gb/s and sustained performance of 512.9Mb/s for data transfers between Texas and California.

Cite

CITATION STYLE

APA

Allcock, B., Foster, I., Nefedova, V., Chervenak, A., Deelman, E., Kesselman, C., … Williams, D. (2001). High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies. In Proceedings of the International Conference on Supercomputing (p. 46). Association for Computing Machinery. https://doi.org/10.1145/582034.582080

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free