Differentially private analysis of outliers

Rina Okada; Kazuto Fukuchi; Jun Sakuma

Conference ProceedingsOPEN ACCESS

Differentially private analysis of outliers

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9285 458-473

DOI: 10.1007/978-3-319-23525-7_28

13Citations

20Readers

Abstract

This paper presents an investigation of differentially private analysis of distance-based outliers. Outlier detection aims to identify instances that are apparently distant from other instances. Meanwhile, the objective of differential privacy is to conceal the presence (or absence) of any particular instance. Outlier detection and privacy protection are therefore intrinsically conflicting tasks. In this paper, we present differentially private queries for counting outliers that appear in a given subspace, instead of reporting the outliers detected. Our analysis of the global sensitivity of outlier counts reveals that regular global sensitivitybased methods can make the outputs too noisy, particularly when the dimensionality of the given subspace is high. Noting that the counts of outliers are typically expected to be small compared to the number of data, we introduce a mechanism based on the smooth upper bound of the local sensitivity. This study is the first trial to ensure differential privacy for distance-based outlier analysis. The experimentally obtained results show that our method achieves better utility than global sensitivitybased methods do.

Author supplied keywords

Cite

CITATION STYLE

APA

Okada, R., Fukuchi, K., & Sakuma, J. (2015). Differentially private analysis of outliers. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9285, pp. 458–473). Springer Verlag. https://doi.org/10.1007/978-3-319-23525-7_28

Differentially private analysis of outliers

Abstract

Author supplied keywords

Cite

Register to see more suggestions