Time-sequence data is high dimensional and contains a lot of information, which can be utilized in various fields, such as insurance, finance, and advertising. Personal data including time-sequence data is converted to anonymized datasets, which need to strike a balance between both privacy and utility. In this paper, we consider low-rank matrix factorization as one of anonymization methods and evaluate its efficiency. We convert time-sequence datasets to matrices and evaluate both privacy and utility. The record IDs in time-sequence data are changed at regular intervals to reduce re-identification risk. However, since individuals tend to behave in a similar fashion over periods of time, there remains a risk of record linkage even if record IDs are different. Hence, we evaluate the reidentification and linkage risks as privacy risks of time-sequence data. Our experimental results show that matrix factorization is a viable anonymization method and it can achieve better utility than existing anonymization methods.
CITATION STYLE
Mimoto, T., Hidano, S., Kiyomoto, S., & Miyaji, A. (2020). Anonymization technique based on SGD matrix factorization. IEICE Transactions on Information and Systems, E103D(2), 299–308. https://doi.org/10.1587/transinf.2019INP0013
Mendeley helps you to discover research relevant for your work.