Non-metric multidimensional scaling for privacy-preserving data clustering

Khaled Alotaibi; Victor J. Rayward-Smith; Beatriz De La Iglesia

Conference Proceedings

Non-metric multidimensional scaling for privacy-preserving data clustering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6936 LNCS 287-298

DOI: 10.1007/978-3-642-23878-9_35

5Citations

5Readers

Get full text

Abstract

Outsourcing data to external parties for analysis is risky as the privacy of confidential variables can be easily violated. To eliminate this threat, the data values of these variables should be perturbed before releasing the data. However, the perturbation itself may significantly change the underlying properties of the data, affecting the analysis results. What is required is a subtle transformation to generate perturbed data that maintains, as much as possible, the statistical properties and effectiveness (i.e. the utility) of the original data whilst preserving the privacy. We examine privacy-preserving transformations in the context of data clustering. In particular, this paper demonstrates how non-metric multidimensional scaling (MDS) can be profitably used as a perturbation tool and how the perturbed data can be effectively used in clustering analysis without compromising privacy or utility. We apply the proposed technique to real datasets and compare the results, which were, in some circumstances, exactly the same as those obtained from the original data. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Alotaibi, K., Rayward-Smith, V. J., & De La Iglesia, B. (2011). Non-metric multidimensional scaling for privacy-preserving data clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6936 LNCS, pp. 287–298). https://doi.org/10.1007/978-3-642-23878-9_35

Non-metric multidimensional scaling for privacy-preserving data clustering

Abstract

Author supplied keywords

Cite

Register to see more suggestions