SCIM: Universal single-cell matching with unpaired feature sets

35Citations
Citations of this article
76Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Recent technological advances have led to an increase in the production and availability of single-cell data. The ability to integrate a set of multi-technology measurements would allow the identification of biologically or clinically meaningful observations through the unification of the perspectives afforded by each technology. In most cases, however, profiling technologies consume the used cells and thus pairwise correspondences between datasets are lost. Due to the sheer size single-cell datasets can acquire, scalable algorithms that are able to universally match single-cell measurements carried out in one cell to its corresponding sibling in another technology are needed. Results: We propose Single-Cell data Integration via Matching (SCIM), a scalable approach to recover such correspondences in two or more technologies. SCIM assumes that cells share a common (low-dimensional) underlying structure and that the underlying cell distribution is approximately constant across technologies. It constructs a technology-invariant latent space using an autoencoder framework with an adversarial objective. Multi-modal datasets are integrated by pairing cells across technologies using a bipartite matching scheme that operates on the low-dimensional latent representations. We evaluate SCIM on a simulated cellular branching process and show that the cell-to-cell matches derived by SCIM reflect the same pseudotime on the simulated dataset. Moreover, we apply our method to two real-world scenarios, a melanoma tumor sample and a human bone marrow sample, where we pair cells from a scRNA dataset to their sibling cells in a CyTOF dataset achieving 90% and 78% cell-matching accuracy for each one of the samples, respectively.

References Powered by Scopus

Comprehensive Integration of Single-Cell Data

8238Citations
N/AReaders
Get full text

Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq

2981Citations
N/AReaders
Get full text

mRNA-Seq whole-transcriptome analysis of a single cell

2673Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Multi-omics single-cell data integration and regulatory inference with graph-linked embedding

222Citations
N/AReaders
Get full text

Computational principles and challenges in single-cell data integration

208Citations
N/AReaders
Get full text

Inferring and perturbing cell fate regulomes in human brain organoids

102Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Stark, S. G., Ficek, J., Locatello, F., Bonilla, X., Chevrier, S., Singer, F., … Gregor, Z. (2020). SCIM: Universal single-cell matching with unpaired feature sets. Bioinformatics, 36, I919–I927. https://doi.org/10.1093/bioinformatics/btaa843

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 26

58%

Researcher 14

31%

Professor / Associate Prof. 4

9%

Lecturer / Post doc 1

2%

Readers' Discipline

Tooltip

Computer Science 15

38%

Biochemistry, Genetics and Molecular Bi... 12

31%

Agricultural and Biological Sciences 10

26%

Physics and Astronomy 2

5%

Save time finding and organizing research with Mendeley

Sign up for free