PeacoQC: Peak-based selection of high quality cytometry data

36Citations
Citations of this article
76Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In cytometry analysis, a large number of markers is measured for thousands or millions of cells, resulting in high-dimensional datasets. During the measurement of these samples, erroneous events can occur such as clogs, speed changes, slow uptake of the sample etc., which can influence the downstream analysis and can even lead to false discoveries. As these issues can be difficult to detect manually, an automated approach is recommended. In order to filter these erroneous events out, we created a novel quality control algorithm, Peak Extraction And Cleaning Oriented Quality Control (PeacoQC), that allows for automated cleaning of cytometry data. The algorithm will determine density peaks per channel on which it will remove low quality events based on their position in the isolation tree and on their mean absolute deviation distance to these density peaks. To evaluate PeacoQC's cleaning capability, it was compared to three other existing quality control algorithms (flowAI, flowClean and flowCut) on a wide variety of datasets. In comparison to the other algorithms, PeacoQC was able to filter out all different types of anomalies in flow, mass and spectral cytometry data, while the other methods struggled with at least one type. In the quantitative comparison, PeacoQC obtained the highest median balanced accuracy and a similar running time compared to the other algorithms while having a better scalability for large files. To ensure that the parameters chosen in the PeacoQC algorithm are robust, the cleaning tool was run on 16 public datasets. After inspection, only one sample was found where the parameters should be further optimized. The other 15 datasets were analyzed correctly indicating a robust parameter choice. Overall, we present a fast and accurate quality control algorithm that outperforms existing tools and ensures high-quality data that can be used for further downstream analysis. An R implementation is available.

References Powered by Scopus

Isolation forest

4723Citations
N/AReaders
Get full text

FlowSOM: Using self-organizing maps for visualization and interpretation of cytometry data

1196Citations
N/AReaders
Get full text

EuroFlow standardization of flow cytometer instrument settings and immunophenotyping protocols

619Citations
N/AReaders
Get full text

Cited by Powered by Scopus

LXR signaling controls homeostatic dendritic cell maturation

31Citations
N/AReaders
Get full text

How to Prepare Spectral Flow Cytometry Datasets for High Dimensional Data Analysis: A Practical Workflow

24Citations
N/AReaders
Get full text

Challenges in translational machine learning

15Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Emmaneel, A., Quintelier, K., Sichien, D., Rybakowska, P., Marañón, C., Alarcón-Riquelme, M. E., … Saeys, Y. (2022). PeacoQC: Peak-based selection of high quality cytometry data. Cytometry Part A, 101(4), 325–338. https://doi.org/10.1002/cyto.a.24501

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 18

56%

Researcher 10

31%

Professor / Associate Prof. 2

6%

Lecturer / Post doc 2

6%

Readers' Discipline

Tooltip

Immunology and Microbiology 16

38%

Biochemistry, Genetics and Molecular Bi... 15

36%

Pharmacology, Toxicology and Pharmaceut... 6

14%

Medicine and Dentistry 5

12%

Save time finding and organizing research with Mendeley

Sign up for free