Abstract
Researchers view vast zeros in single-cell RNA-seq data differently: some regard zeros as biological signals representing no or low gene expression, while others regard zeros as missing data to be corrected. To help address the controversy, here we discuss the sources of biological and non-biological zeros; introduce five mechanisms of adding non-biological zeros in computational benchmarking; evaluate the impacts of non-biological zeros on data analysis; benchmark three input data types: observed counts, imputed counts, and binarized counts; discuss the open questions regarding non-biological zeros; and advocate the importance of transparent analysis.
Cite
CITATION STYLE
Jiang, R., Sun, T., Song, D., & Li, J. J. (2022, December 1). Statistics or biology: the zero-inflation controversy about scRNA-seq data. Genome Biology. BioMed Central Ltd. https://doi.org/10.1186/s13059-022-02601-5
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.