Abstract
The advent of single-cell RNA sequencing (scRNA-seq) has revolutionized our understanding of cellular heterogeneity and complexity in biological tissues. However, the nature of large, sparse scRNA-seq datasets and privacy regulations present challenges for efficient cell identification. Federated learning provides a solution, allowing efficient and private data use. Here, we introduce scFed, a unified federated learning framework that allows for benchmarking of four classification algorithms without violating data privacy, including single-cell-specific and general-purpose classifiers. We evaluated scFed using eight publicly available scRNA-seq datasets with diverse sizes, species and technologies, assessing its performance via intra-dataset and inter-dataset experimental setups. We find that scFed performs well on a variety of datasets with competitive accuracy to centralized models. Though Transformer-based model excels in centralized training, its performance slightly lags behind single-cell-specific model within the scFed framework, coupled with a notable time complexity concern. Our study not only helps select suitable cell identification methods but also highlights federated learning's potential for privacy-preserving, collaborative biomedical research.
Author supplied keywords
Cite
CITATION STYLE
Wang, S., Shen, B., Guo, L., Shang, M., Liu, J., Sun, Q., & Shen, B. (2024). ScFed: Federated learning for cell type classification with scRNA-seq. Briefings in Bioinformatics, 25(1). https://doi.org/10.1093/bib/bbad507
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.