Challenges in the Multivariate Analysis of Mass Cytometry Data: The Effect of Randomization

11Citations
Citations of this article
44Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Cytometry by time-of-flight (CyTOF) has emerged as a high-throughput single cell technology able to provide large samples of protein readouts. Already, there exists a large pool of advanced high-dimensional analysis algorithms that explore the observed heterogeneous distributions making intriguing biological inferences. A fact largely overlooked by these methods, however, is the effect of the established data preprocessing pipeline to the distributions of the measured quantities. In this article, we focus on randomization, a transformation used for improving data visualization, which can negatively affect multivariate data analysis methods such as dimensionality reduction, clustering, and network reconstruction algorithms. Our results indicate that randomization should be used only for visualization purposes, but not in conjunction with high-dimensional analytical tools. © 2019 The Authors. Cytometry Part A published by Wiley Periodicals, Inc. on behalf of International Society for Advancement of Cytometry.

Cite

CITATION STYLE

APA

Papoutsoglou, G., Lagani, V., Schmidt, A., Tsirlis, K., Cabrero, D. G., Tegnér, J., & Tsamardinos, I. (2019). Challenges in the Multivariate Analysis of Mass Cytometry Data: The Effect of Randomization. Cytometry Part A, 95(11), 1178–1190. https://doi.org/10.1002/cyto.a.23908

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free