Anonymization technique based on SGD matrix factorization

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Time-sequence data is high dimensional and contains a lot of information, which can be utilized in various fields, such as insurance, finance, and advertising. Personal data including time-sequence data is converted to anonymized datasets, which need to strike a balance between both privacy and utility. In this paper, we consider low-rank matrix factorization as one of anonymization methods and evaluate its efficiency. We convert time-sequence datasets to matrices and evaluate both privacy and utility. The record IDs in time-sequence data are changed at regular intervals to reduce re-identification risk. However, since individuals tend to behave in a similar fashion over periods of time, there remains a risk of record linkage even if record IDs are different. Hence, we evaluate the reidentification and linkage risks as privacy risks of time-sequence data. Our experimental results show that matrix factorization is a viable anonymization method and it can achieve better utility than existing anonymization methods.

Cite

CITATION STYLE

APA

Mimoto, T., Hidano, S., Kiyomoto, S., & Miyaji, A. (2020). Anonymization technique based on SGD matrix factorization. IEICE Transactions on Information and Systems, E103D(2), 299–308. https://doi.org/10.1587/transinf.2019INP0013

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free