PHIs (Protected Health Information) identification from free text clinical records based on machine learning

1Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

To preserve patient confidentiality, there is a need to identify PHIs (Protected Health Information) from free text text clinical records, and such sensitive information must either be removed or replaced. Identification of the PHI's are normally performed manually on large sets of structured EHR databases, which is time-consuming, prohibitively expensive and error-prone. Hence, methods for automatic or semi-automatic identification of personal health information are of significant scientific and commercial interest. In this paper, we propose an innovative computational framework based on novel text mining and machine learning algorithms for automatic identification of PHIs from massive, unstructured free text clinical records, discharge summaries and other care documents. The experimental evaluation of the proposed algorithmic framework development, for several publicly available i2b2 challenge datasets from Informatics for Integrating Biology & the Bedside (i2b2) shared tasks, has shown promising outcomes.

Cite

CITATION STYLE

APA

Rajput, K., Chetty, G., & Davey, R. (2018). PHIs (Protected Health Information) identification from free text clinical records based on machine learning. In 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings (Vol. 2018-January, pp. 1–9). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/SSCI.2017.8285286

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free