The protection of personal identifiable information (PII) is increasingly demanded by customers and data protection regulation. To safeguard PII a organization has to find out which incoming communication actually contains it. Only then PII can be labeled, tracked, and protected. E-mails are one of the main means of communication. They consist of unstructured data difficult to classify. We developed an automated detection system for PII in e-mails and connected it to a usage control infrastructure. Our concept is based on previous findings in the area of spam detection. We tested our approach with a data set in a customer service scenario. The evaluation shows that the utilization of Bayes-classification is very promising to detect PII.
CITATION STYLE
Bier, C., & Prior, J. (2014). Detection and labeling of personal identifiable information in E-mails. In IFIP Advances in Information and Communication Technology (Vol. 428, pp. 351–358). Springer Science and Business Media, LLC. https://doi.org/10.1007/978-3-642-55415-5_29
Mendeley helps you to discover research relevant for your work.