Utility-Driven k-Anonymization of Public Transport User Data

Bhawani Shanker Bhati; Jordan Ivanchev; Iva Bojic; Anwitaman Datta; David Eckhoff

Journal ArticleOPEN ACCESS

Utility-Driven k-Anonymization of Public Transport User Data

IEEE Access (2021) 9 23608-23623

DOI: 10.1109/ACCESS.2021.3055505

6Citations

11Readers

Abstract

In this article, we propose a k -anonymity approach that prioritizes the generalization of attributes based on their utility. We focus on transport data, which we consider a special case in which many or all attributes are quasi-identifiers (e.g., origin, destination, ride start time), as they allow correlation with easily observable auxiliary data. The novelty in our approach lies in introducing normalization techniques as well as distance and utility metrics that allow the consideration of not only numerical attributes but also categorical attributes by representing them in tree or graph form. The prioritization of the attributes in the generalization process is based on the attributes' utility and can further be influenced by either automatically or manually assigned attribute weights. We evaluate and compare different options for all components of our mechanism as well as present an extensive performance evaluation of our approach using real-world data. Lastly, we show in which cases suppression of records can counter-intuitively lead to higher data utility.

Author supplied keywords

Cite

CITATION STYLE

APA

Bhati, B. S., Ivanchev, J., Bojic, I., Datta, A., & Eckhoff, D. (2021). Utility-Driven k-Anonymization of Public Transport User Data. IEEE Access, 9, 23608–23623. https://doi.org/10.1109/ACCESS.2021.3055505

Utility-Driven k-Anonymization of Public Transport User Data

Abstract

Author supplied keywords

Cite

Register to see more suggestions