Medical concept normalization in user-generated texts by learning target concept embeddings

2Citations
Citations of this article
74Readers
Mendeley users who have this article in their library.

Abstract

Medical concept normalization helps in discovering standard concepts in free-form text i.e., maps health-related mentions to standard concepts in a clinical knowledge base. It is much beyond simple string matching and requires a deep semantic understanding of concept mentions. Recent research approach concept normalization as either text classification or text similarity. The main drawback in existing a) text classification approach is ignoring valuable target concepts information in learning input concept mention representation b) text similarity approach is the need to separately generate target concept embeddings which is time and resource consuming. Our proposed model overcomes these drawbacks by jointly learning the representations of input concept mention and target concepts. First, we learn input concept mention representation using RoBERTa. Second, we find cosine similarity between embeddings of input concept mention and all the target concepts. Here, embeddings of target concepts are randomly initialized and then updated during training. Finally, the target concept with maximum cosine similarity is assigned to the input concept mention. Our model surpasses all the existing methods across three standard datasets by improving accuracy up to 2.31%.

References Powered by Scopus

Label-Embedding for Image Classification

674Citations
421Readers
Get full text

This article is free to access.

Joint embedding of words and labels for text classification

274Citations
533Readers

Cited by Powered by Scopus

This article is free to access.

NSSC: a neuro-symbolic AI system for enhancing accuracy of named entity recognition and linking from oncologic clinical notes

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Kalyan, K. S., & Sangeetha, S. (2020). Medical concept normalization in user-generated texts by learning target concept embeddings. In EMNLP 2020 - 11th International Workshop on Health Text Mining and Information Analysis, LOUHI 2020, Proceedings of the Workshop (pp. 18–23). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.louhi-1.3

Readers over time

‘20‘21‘22‘23‘24‘2508162432

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 22

73%

Researcher 5

17%

Lecturer / Post doc 3

10%

Readers' Discipline

Tooltip

Computer Science 26

74%

Linguistics 6

17%

Business, Management and Accounting 2

6%

Neuroscience 1

3%

Save time finding and organizing research with Mendeley

Sign up for free
0