Balancing quality and confidentiality for multivariate tabular data

21Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Absolute cell deviation has been used as a proxy for preserving data quality in statistical disclosure limitation for tabular data. However, users' primary interest is that analytical properties of the data are for the most part preserved, meaning that the values of key statistics are nearly unchanged. Moreover, important relationships within (additivity) and between (correlation) the published tables should also be unaffected. Previous work demonstrated how to preserve additivity, mean and variance in for univariate tabular data. In this paper, we bridge the gap between statistics and mathematical programming to propose nonlinear and linear models based on constraint satisfaction to preserve additivity and covariance, correlation, and regression coefficient between data tables. Linear models are superior than nonlinear models owing to simplicity, flexibility and computational speed. Simulations demonstrate the models perform well in terms of preserving key statistics with reasonable accuracy. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Cox, L. H., Kelly, J. P., & Patil, R. (2004). Balancing quality and confidentiality for multivariate tabular data. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3050, 87–98. https://doi.org/10.1007/978-3-540-25955-8_7

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free