PPMark: An architecture to generate privacy labels using TF-IDF techniques and the rabin karp algorithm

Diego Roberto Gonçalves de Pontes; Sergio Donizetti Zorzo

Conference Proceedings

PPMark: An architecture to generate privacy labels using TF-IDF techniques and the rabin karp algorithm

Advances in Intelligent Systems and Computing (2016) 448 1029-1040

DOI: 10.1007/978-3-319-32467-8_89

2Citations

8Readers

Get full text

Abstract

Layman and non-layman users often have difficulties to understand privacy policy texts. The amount of time spent on reading and comprehending a policy poses a challenge to the user, who rarely pays attention to what he or she is agreeing to. Given this scenario, this paper aims to facilitate privacy policy terms presentation regarding data collection and sharing by introducing a new format called Privacy Label. Using natural language processing techniques, a model able to extract information about data collection in privacy policies and present them in an automated and easy-to-understand way to the user was built. To validate this model we used a precision assessment method where the accuracy of the extracted information was measured. The precision of our modelwas 0.685 (69%)when recovering information regarding data handling, making it possible for the final user to understand which data is being collected without reading the whole policy. The PPMark architecture can facilitate the notice-and-choice by presenting privacy policy information in an alternative way for online users.

Cite

CITATION STYLE

APA

Gonçalves de Pontes, D. R., & Zorzo, S. D. (2016). PPMark: An architecture to generate privacy labels using TF-IDF techniques and the rabin karp algorithm. In Advances in Intelligent Systems and Computing (Vol. 448, pp. 1029–1040). Springer Verlag. https://doi.org/10.1007/978-3-319-32467-8_89

PPMark: An architecture to generate privacy labels using TF-IDF techniques and the rabin karp algorithm

Abstract

Cite

Register to see more suggestions