Reproducibility Starts at the Source: R, Python, and Julia Packages for Retrieving USGS Hydrologic Data

1Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

Much of modern science takes place in a computational environment, and, increasingly, that environment is programmed using R, Python, or Julia. Furthermore, most scientific data now live on the cloud, so the first step in many workflows is to query a cloud database and load the response into a computational environment for further analysis. Thus, tools that facilitate programmatic data retrieval represent a critical component in reproducible scientific workflows. Earth science is no different in this regard. To fulfill that basic need, we developed R, Python, and Julia packages providing programmatic access to the U.S. Geological Survey’s National Water Information System database and the multi-agency Water Quality Portal. Together, these packages create a common interface for retrieving hydrologic data in the Jupyter ecosystem, which is widely used in water research, operations, and teaching. Source code, documentation, and tutorials for the packages are available on GitHub. Users can go there to learn, raise issues, or contribute improvements within a single platform, which helps foster better engagement and collaboration between data providers and their users.

Cite

CITATION STYLE

APA

Hodson, T. O., DeCicco, L. A., Hariharan, J. A., Stanish, L. F., Black, S., & Horsburgh, J. S. (2023). Reproducibility Starts at the Source: R, Python, and Julia Packages for Retrieving USGS Hydrologic Data. Water (Switzerland), 15(24). https://doi.org/10.3390/w15244236

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free