Pysradb: A Python package to query next-generation sequencing metadata and data from NCBI sequence read archive

27Citations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

The NCBI Sequence Read Archive (SRA) is the primary archive of next-generation sequencing datasets. SRA makes metadata and raw sequencing data available to the research community to encourage reproducibility and to provide avenues for testing novel hypotheses on publicly available data. However, methods to programmatically access this data are limited. We introduce the Python package, pysradb, which provides a collection of command line methods to query and download metadata and data from SRA, utilizing the curated metadata database available through the SRAdb project. We demonstrate the utility of pysradb on multiple use cases for searching and downloading SRA datasets. It is available freely at https://github.com/saketkc/pysradb.

Author supplied keywords

Cite

CITATION STYLE

APA

Choudhary, S. (2019). Pysradb: A Python package to query next-generation sequencing metadata and data from NCBI sequence read archive. F1000Research, 8. https://doi.org/10.12688/f1000research.18676.1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free