Privacy-preserving constrained spectral clustering algorithm for large-scale data sets

8Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

With the increasing concern on the preservation of personal privacy, privacy-preserving data mining has become a hot topic in recent years. Spectral clustering is one of the most widely used clustering algorithm for exploratory data analysis and usually has to deal with sensitive data sets. How to conduct privacy-preserving spectral clustering is an urgent problem to be solved. In this study, the authors focus on introducing the notion of differential privacy, which is considered as the de facto standard of privacy-preserving data analysis, into spectral clustering. Specifically, by combining the well-studied constrained spectral clustering with the Wishart mechanism in a novel way, the authors propose a differentially private constrained spectral clustering (DP-CSC) algorithm. The DP-CSC algorithm is proved to capture asymptotic property and achieves ?-differential privacy. To illustrate the effectiveness and efficiency of DP-CSC, the authors conduct experiments on five real-word data sets. The results indicate that the DP-CSC algorithm can provide acceptable clustering accuracy with short running time while preserving individual privacy.

Cite

CITATION STYLE

APA

Li, J., Wei, J., Ye, M., Liu, W., & Hu, X. (2020). Privacy-preserving constrained spectral clustering algorithm for large-scale data sets. IET Information Security, 14(3), 321–331. https://doi.org/10.1049/iet-ifs.2019.0255

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free