Machine learning with membership privacy using adversarial regularization

Milad Nasr; Reza Shokri; Amir Houmansadr

Conference Proceedings

Machine learning with membership privacy using adversarial regularization

Proceedings of the ACM Conference on Computer and Communications Security (2018) 634-646

DOI: 10.1145/3243734.3243855

399Citations

324Readers

Get full text

Abstract

Machine learning models leak significant amount of information about their training sets, through their predictions. This is a serious privacy concern for the users of machine learning as a service. To address this concern, in this paper, we focus on mitigating the risks of black-box inference attacks against machine learning models. We introduce a mechanism to train models with membership privacy, which ensures indistinguishability between the predictions of a model on its training data and other data points (from the same distribution). This requires minimizing the accuracy of the best black-box membership inference attack against the model. We formalize this as a min-max game, and design an adversarial training algorithm that minimizes the prediction loss of the model as well as the maximum gain of the inference attacks. This strategy, which can guarantee membership privacy (as prediction indistinguishability), acts also as a strong regularizer and helps generalizing the model. We evaluate the practical feasibility of our privacy mechanism on training deep neural networks using benchmark datasets. We show that the min-max strategy can mitigate the risks of membership inference attacks (near random guess), and can achieve this with a negligible drop in the model’s prediction accuracy (less than 4%).

Author supplied keywords

Cite

CITATION STYLE

APA

Nasr, M., Shokri, R., & Houmansadr, A. (2018). Machine learning with membership privacy using adversarial regularization. In Proceedings of the ACM Conference on Computer and Communications Security (pp. 634–646). Association for Computing Machinery. https://doi.org/10.1145/3243734.3243855

Machine learning with membership privacy using adversarial regularization

Abstract

Author supplied keywords

Cite

Register to see more suggestions