PPMark: An architecture to generate privacy labels using TF-IDF techniques and the rabin karp algorithm

2Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Layman and non-layman users often have difficulties to understand privacy policy texts. The amount of time spent on reading and comprehending a policy poses a challenge to the user, who rarely pays attention to what he or she is agreeing to. Given this scenario, this paper aims to facilitate privacy policy terms presentation regarding data collection and sharing by introducing a new format called Privacy Label. Using natural language processing techniques, a model able to extract information about data collection in privacy policies and present them in an automated and easy-to-understand way to the user was built. To validate this model we used a precision assessment method where the accuracy of the extracted information was measured. The precision of our modelwas 0.685 (69%)when recovering information regarding data handling, making it possible for the final user to understand which data is being collected without reading the whole policy. The PPMark architecture can facilitate the notice-and-choice by presenting privacy policy information in an alternative way for online users.

Cite

CITATION STYLE

APA

Gonçalves de Pontes, D. R., & Zorzo, S. D. (2016). PPMark: An architecture to generate privacy labels using TF-IDF techniques and the rabin karp algorithm. In Advances in Intelligent Systems and Computing (Vol. 448, pp. 1029–1040). Springer Verlag. https://doi.org/10.1007/978-3-319-32467-8_89

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free