Polled Digital Cell Sorter (p-DCS): Automatic identification of hematological cell types from single cell RNA-sequencing clusters

16Citations
Citations of this article
33Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Single cell RNA sequencing (scRNA-seq) brings unprecedented opportunities for mapping the heterogeneity of complex cellular environments such as bone marrow, and provides insight into many cellular processes. Single cell RNA-seq has a far larger fraction of missing data reported as zeros (dropouts) than traditional bulk RNA-seq, and unsupervised clustering combined with Principal Component Analysis (PCA) can be used to overcome this limitation. After clustering, however, one has to interpret the average expression of markers on each cluster to identify the corresponding cell types, and this is normally done by hand by an expert curator. Results: We present a computational tool for processing single cell RNA-seq data that uses a voting algorithm to automatically identify cells based on approval votes received by known molecular markers. Using a stochastic procedure that accounts for imbalances in the number of known molecular signatures for different cell types, the method computes the statistical significance of the final approval score and automatically assigns a cell type to clusters without an expert curator. We demonstrate the utility of the tool in the analysis of eight samples of bone marrow from the Human Cell Atlas. The tool provides a systematic identification of cell types in bone marrow based on a list of markers of immune cell types, and incorporates a suite of visualization tools that can be overlaid on a t-SNE representation. The software is freely available as a Python package at https://github.com/sdomanskyi/DigitalCellSorter. Conclusions: This methodology assures that extensive marker to cell type matching information is taken into account in a systematic way when assigning cell clusters to cell types. Moreover, the method allows for a high throughput processing of multiple scRNA-seq datasets, since it does not involve an expert curator, and it can be applied recursively to obtain cell sub-types. The software is designed to allow the user to substitute the marker to cell type matching information and apply the methodology to different cellular environments.

References Powered by Scopus

Robust enumeration of cell subsets from tissue expression profiles

8453Citations
N/AReaders
Get full text

Objective criteria for the evaluation of clustering methods

4938Citations
N/AReaders
Get full text

Massively parallel digital transcriptional profiling of single cells

3961Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Automatic cell type identification methods for single-cell RNA sequencing

34Citations
N/AReaders
Get full text

Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data

14Citations
N/AReaders
Get full text

scAnnotatR: framework to accurately classify cell types in single-cell RNA-sequencing data

13Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Domanskyi, S., Szedlak, A., Hawkins, N. T., Wang, J., Paternostro, G., & Piermarocchi, C. (2019). Polled Digital Cell Sorter (p-DCS): Automatic identification of hematological cell types from single cell RNA-sequencing clusters. BMC Bioinformatics, 20(1). https://doi.org/10.1186/s12859-019-2951-x

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 13

62%

Researcher 8

38%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 11

58%

Agricultural and Biological Sciences 4

21%

Computer Science 2

11%

Neuroscience 2

11%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1

Save time finding and organizing research with Mendeley

Sign up for free