The data produced by differential privacy histogram publishing algorithm based on grouping has low usability due to large approximation error and Laplace error. To solve this problem, a histogram publishing algorithm based on roulette sampling sort and greedy partition is proposed. Our algorithm combines the exponential mechanism with the roulette sampling sorting method, arranges the similar histogram bins together with a larger probability by the utility function and the restriction on the number of sampled entity. The greedy clustering algorithm is used to partition the sorted histogram bins into groups, and the error among histogram bins in each group is reduced by optimizing the lower bound error of the grouping. Extensive experimental results show that the proposed algorithm can effectively improve the usability of published data under the premise of satisfying differential privacy.
CITATION STYLE
Wu, X., Tong, N., Ye, Z., & Wang, Y. (2020). Histogram Publishing Algorithm Based on Sampling Sorting and Greedy Clustering. In Communications in Computer and Information Science (Vol. 1156 CCIS, pp. 81–91). Springer. https://doi.org/10.1007/978-981-15-2777-7_7
Mendeley helps you to discover research relevant for your work.