Abstract
We present our work on aligning the Unified Medical Language System (UMLS) to Wikipedia, to facilitate manual alignment of the two resources. We propose a cross-lingual neural reranking model to match a UMLS concept with a Wikipedia page, which achieves a recall@1 of 72%, a substantial improvement of 20% over word- and char-level BM25, enabling manual alignment with minimal effort. We release our resources, including ranked Wikipedia pages for 700k UMLS concepts, and WikiUMLS, a dataset for training and evaluation of alignment models between UMLS and Wikipedia collected from Wikidata. This will provide easier access to Wikipedia for health professionals, patients, and NLP systems, including in multilingual settings.
Cite
CITATION STYLE
Rahimi, A., Baldwin, T., & Verspoor, K. (2020). WikiUMLS: Aligning UMLS to Wikipedia via Cross-lingual Neural Ranking. In COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference (pp. 5957–5962). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-main.523
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.