Abstract
Patients and their families often require a better understanding of medical information provided by doctors. We currently address this issue by improving the identification of difficult to understand medical words. We introduce novel embeddings received from RNN - FrnnMUTE (French RNN Medical Understandability Text Embeddings) which allow to reach up to 87.0 F1 score in identification of difficult words. We also note that adding pre-trained FastText word embeddings to the feature set substantially improves the performance of the model which classifies words according to their difficulty. We study the gen-eralizability of different models through three cross-validation scenarios which allow testing classifiers in real-world conditions: understanding of medical words by new users, and classification of new unseen words by the automatic models. The RNN - FrnnMUTE embeddings and the categorization code are being made available for the research.
Cite
CITATION STYLE
Pylieva, H., Chernodub, A., Grabar, N., & Hamon, T. (2019). RNN embeddings for identifying difficult to understand medical words. In BioNLP 2019 - SIGBioMed Workshop on Biomedical Natural Language Processing, Proceedings of the 18th BioNLP Workshop and Shared Task (pp. 97–104). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-5011
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.