Biomedical domain-oriented word embeddings via small background texts for biomedical text mining tasks

Lishuang Li; Jia Wan; Degen Huang

Conference Proceedings

Biomedical domain-oriented word embeddings via small background texts for biomedical text mining tasks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 10619 LNAI 554-564

DOI: 10.1007/978-3-319-73618-1_46

0Citations

9Readers

Get full text

Abstract

Most word embedding methods are proposed with general purpose which take a word as a basic unit and learn embeddings by words’ external contexts. However, in the field of biomedical text mining, there are many biomedical entities and syntactic chunks which can enrich the semantic meaning of word embeddings. Furthermore, large scale background texts for training word embeddings are not available in some scenarios. Therefore, we propose a novel biomedical domain-specific word embeddings model based on maximum-margin (BEMM) to train word embeddings using small set of background texts, which incorporates biomedical domain information. Experimental results show that our word embeddings overall outperform other general-purpose word embeddings on some biomedical text mining tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, L., Wan, J., & Huang, D. (2018). Biomedical domain-oriented word embeddings via small background texts for biomedical text mining tasks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10619 LNAI, pp. 554–564). Springer Verlag. https://doi.org/10.1007/978-3-319-73618-1_46

Biomedical domain-oriented word embeddings via small background texts for biomedical text mining tasks

Abstract

Author supplied keywords

Cite

Register to see more suggestions