Data anonymization approach for incomplete microdata

6Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

To protect privacy against linking attacks, quasi-identifier attributes of microdata should be anonymized in privacy preserving data publishing. Although lots of algorithms have been proposed in this area, few of them can handle incomplete microdata. Most existing algorithms simply delete records with missing values, causing large information loss. This paper proposes a novel data anonymization approach called KAIM (k-anonymity for incomplete microdata), for incomplete microdata based on k-member algorithm and information entropy distance. Instead of deleting any records, KAIM effectively clusters records with similar characteristics together to minimize information loss, and then generalizes all records with local recording scheme. Results of extensive experiments base on real dataset show that KAIM causes only 43.8% information loss compared with previous algorithms for incomplete microdata, validating that KAIM performs much better than existing algorithms on the utility of anonymized dataset. © Copyright 2013, Institute of Software, the Chinese Academy of Sciences. All rights reserved.

Cite

CITATION STYLE

APA

Gong, Q. Y., Yang, M., & Luo, J. Z. (2013). Data anonymization approach for incomplete microdata. Ruan Jian Xue Bao/Journal of Software, 24(12), 2883–2896. https://doi.org/10.3724/SP.J.1001.2013.04411

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free