Abstract
Single-cell flow and mass cytometry technologies are being increasingly applied in clinical settings, as they enable the simultaneous measurement of multiple proteins across millions of cells within a multi-patient cohort. In this work, we introduce CytoSet, a deep learning model that can directly predict a patient's clinical outcome from a collection of cells obtained through a blood or tissue sample. Unlike previous work, CytoSet explicitly models the cells profiled in each patient sample as a set, allowing for the use of recently developed permutation invariant architectures. We show that CytoSet achieves state-of-the-art classification performance across a variety of flow and mass cytometry benchmark datasets. The strong classification performance is further complemented by demonstrated robustness to the number of sub-sampled cells per patient and the depth of model, enabling CytoSet to scale adequately to hundreds of patient samples. The strong performance achieved by the set-based architectures used in CytoSet suggests that clinical cytometry data can be appropriately interpreted and studied as sets. The code is publicly available at https://github.com/CompCy-lab/cytoset.
Author supplied keywords
Cite
CITATION STYLE
Yi, H., & Stanley, N. (2021). CytoSet: Predicting clinical outcomes via set-modeling of cytometry data. In Proceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2021. Association for Computing Machinery, Inc. https://doi.org/10.1145/3459930.3469529
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.