Differential privacy based on importance weighting

25Citations
Citations of this article
46Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

This paper analyzes a novel method for publishing data while still protecting privacy. The method is based on computing weights that make an existing dataset, for which there are no confidentiality issues, analogous to the dataset that must be kept private. The existing dataset may be genuine but public already, or it may be synthetic. The weights are importance sampling weights, but to protect privacy, they are regularized and have noise added. The weights allow statistical queries to be answered approximately while provably guaranteeing differential privacy. We derive an expression for the asymptotic variance of the approximate answers. Experiments show that the new mechanism performs well even when the privacy budget is small, and when the public and private datasets are drawn from different populations. © 2013 The Author(s).

Cite

CITATION STYLE

APA

Ji, Z., & Elkan, C. (2013). Differential privacy based on importance weighting. Machine Learning, 93(1), 163–183. https://doi.org/10.1007/s10994-013-5396-x

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free