Accelerating Experimental Science Using Jupyter and NERSC HPC

5Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Large scale experimental science workflows require support for a unified, interactive, real-time platform that can manage a distributed set of resources connected to High Performance Computing (HPC) systems. What is needed is a tool that provides the ease-of-use and interactivity of a web science gateway, while providing the scientist the ability to build custom, ad-hoc workflows in a composable way. The Jupyter platform can play a key role here to enable the ingestion and analysis of real-time streaming data, integrate with HPC resources in a closed-loop, and enable interactive ad-hoc analyses with running workflows. We want to enable high-quality reproducible human-in-the-loop science using HPC and Jupyter at the National Energy Research Scientific Computing Center (NERSC). Achieving that goal is challenging in the general case because scientific workflows and data can vary significantly in size and type between disciplines. There are many areas of work to achieve highly reproducible science, let alone human-in-the-loop interactive scientific workflows, but we focus here on some basic elements for enabling an improved interactive HPC experience including creating reusable recipes and workflows with Notebooks, sharing and cloning Notebooks, and parallelization and scaling of scientific code requiring HPC and using Jupyter.

Cite

CITATION STYLE

APA

Henderson, M. L., Krinsman, W., Cholia, S., Thomas, R., & Slaton, T. (2020). Accelerating Experimental Science Using Jupyter and NERSC HPC. In Communications in Computer and Information Science (Vol. 1190 CCIS, pp. 145–163). Springer. https://doi.org/10.1007/978-3-030-44728-1_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free