Sfaira accelerates data and model reuse in single cell genomics

20Citations
Citations of this article
85Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Single-cell RNA-seq datasets are often first analyzed independently without harnessing model fits from previous studies, and are then contextualized with public data sets, requiring time-consuming data wrangling. We address these issues with sfaira, a single-cell data zoo for public data sets paired with a model zoo for executable pre-trained models. The data zoo is designed to facilitate contribution of data sets using ontologies for metadata. We propose an adaption of cross-entropy loss for cell type classification tailored to datasets annotated at different levels of coarseness. We demonstrate the utility of sfaira by training models across anatomic data partitions on 8 million cells.

Cite

CITATION STYLE

APA

Fischer, D. S., Dony, L., König, M., Moeed, A., Zappia, L., Heumos, L., … Theis, F. J. (2021). Sfaira accelerates data and model reuse in single cell genomics. Genome Biology, 22(1). https://doi.org/10.1186/s13059-021-02452-6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free