Towards a smart, internet-scale cache service for data intensive scientific applications

4Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Data and services provided by shared facilities, such as large-scale observing facilities, have become important enablers of scientific insights and discoveries across many science and engineering disciplines. Ensuring satisfactory quality of service can be challenging for facilities, due to their remote locations and to the distributed nature of the instruments, observatories, and users, as well as the rapid growth of data volumes and rates. This research explores how knowledge of the facilities usage patterns, coupled with emerging cyberinfrastructures can be leveraged to improve their performance, usability, and scientific impact. We propose a framework with a smart, internet-scale cache augmented with prefetching and data placement strategies to improve data delivery performance for scientific facilities. Our evaluations, which are based on the NSF Ocean Observatories Initiative, demonstrate that our framework is able to predict user requests and reduce data movements by more than 56% across networks.

Cite

CITATION STYLE

APA

Qin, Y., Simonet, A., Davis, P. E., Nouri, A., Wang, Z., Parashar, M., & Rodero, I. (2019). Towards a smart, internet-scale cache service for data intensive scientific applications. In ScienceCloud 2019 - Proceedings of the 10th Workshop on Scientific Cloud Computing, co-located with HPDC 2019 (pp. 11–18). Association for Computing Machinery, Inc. https://doi.org/10.1145/3322795.3331464

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free