Abstract
With the increasing concern on the preservation of personal privacy, privacy-preserving data mining has become a hot topic in recent years. Spectral clustering is one of the most widely used clustering algorithm for exploratory data analysis and usually has to deal with sensitive data sets. How to conduct privacy-preserving spectral clustering is an urgent problem to be solved. In this study, the authors focus on introducing the notion of differential privacy, which is considered as the de facto standard of privacy-preserving data analysis, into spectral clustering. Specifically, by combining the well-studied constrained spectral clustering with the Wishart mechanism in a novel way, the authors propose a differentially private constrained spectral clustering (DP-CSC) algorithm. The DP-CSC algorithm is proved to capture asymptotic property and achieves ?-differential privacy. To illustrate the effectiveness and efficiency of DP-CSC, the authors conduct experiments on five real-word data sets. The results indicate that the DP-CSC algorithm can provide acceptable clustering accuracy with short running time while preserving individual privacy.
Cite
CITATION STYLE
Li, J., Wei, J., Ye, M., Liu, W., & Hu, X. (2020). Privacy-preserving constrained spectral clustering algorithm for large-scale data sets. IET Information Security, 14(3), 321–331. https://doi.org/10.1049/iet-ifs.2019.0255
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.