Augmenting embedding with domain knowledge for oral disease diagnosis prediction

1Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we propose to add domain knowledge from the most comprehensive biomedical ontology SNOMED CT to facilitate the embedding of EMR symptoms and diagnoses for oral disease prediction. We first learn embeddings of SNOMED CT concepts by applying the TransE algorithm prevalent for representation learning of knowledge base. Secondly, the mapping from symptoms/diagnoses to biomedical concepts and the corresponding semantic relations defined in SNOMED CT are modeled mathematically. We design a neural network to train embeddings of EMR symptoms and diagnoses and ontological concepts in a coherent way, for the latter the TransE-learned vectors being used as initial values. The evaluation on real-world EMR datasets from Peking University School and Hospital Stomatology demonstrates the prediction performance improvement over embeddings solely based on EMRs. This study contributes as a first attempt to learn distributed representations of EMR symptoms and diagnoses under the constraint of embeddings of biomedical concepts from comprehensive clinical ontology. Incorporating domain knowledge can augment embedding as it reveals intrinsic correlation among symptoms and diagnoses that cannot be discovered by EMR data alone.

Cite

CITATION STYLE

APA

Li, G., Zhang, S., Liang, J., Cao, Z., & Guo, C. (2018). Augmenting embedding with domain knowledge for oral disease diagnosis prediction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11344 LNCS, pp. 236–250). Springer Verlag. https://doi.org/10.1007/978-3-030-05755-8_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free