Nebula: A Scalable Privacy-Preserving Machine Learning System in Ant Financial

Cen Chen; Bingzhe Wu; Li Wang; Chaochao Chen; Jin Tan; Lei Wang; Jun Zhou; Benyu Zhang

Conference ProceedingsOPEN ACCESS

Nebula: A Scalable Privacy-Preserving Machine Learning System in Ant Financial

International Conference on Information and Knowledge Management, Proceedings (2020) 3369-3372

DOI: 10.1145/3340531.3417418

4Citations

25Readers

Get full text

Abstract

With the rapid growth of data volume, data-driven machine learning models have become a necessary part of many industrial applications. Intuitively, the more high-quality data used for training leads to better model performance. However, in reality, data are usually scattered and isolated in different organizations or companies. Such a "data isolation" problem stimulates both academia and industry to explore the collaborative learning paradigm to build better models jointly with multiple data sources. Despite the potential performance gains, this learning paradigm inevitably faces privacy issues, especially for the Fintech domain where data are sensitive by nature. In this paper, we present a privacy-preserving collaborative learning system in Ant Financial, named Nebula. Our system aims to facilitate privacy-preserving collaborative model training for industrial-scale applications. Our system is built upon a ring-allreduce MPI based distributed framework. On top of that, with some optimization strategies and novel sharing scheme, our system is able to scale up to tens of millions of data samples with hundreds of thousands of features and achieve more than 100x speedup compared with the existing state-of-the-art implementations.

Author supplied keywords

Cite

CITATION STYLE

APA

Chen, C., Wu, B., Wang, L., Chen, C., Tan, J., Wang, L., … Zhang, B. (2020). Nebula: A Scalable Privacy-Preserving Machine Learning System in Ant Financial. In International Conference on Information and Knowledge Management, Proceedings (pp. 3369–3372). Association for Computing Machinery. https://doi.org/10.1145/3340531.3417418

Nebula: A Scalable Privacy-Preserving Machine Learning System in Ant Financial

Abstract

Author supplied keywords

Cite

Register to see more suggestions