Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation

4Citations
Citations of this article
32Readers
Mendeley users who have this article in their library.

Abstract

Many clinical informatics tasks that are based on electronic health records (EHR) need relevant patient cohorts to be selected based on findings, symptoms and diseases. Frequently, these conditions are described in radiology reports which can be retrieved using information retrieval (IR) methods. The latest of these techniques utilize neural IR models such as BERT trained on clinical text. However, these methods still lack semantic understanding of the underlying clinical conditions as well as ruled out findings, resulting in poor precision during retrieval. In this paper we combine clinical finding detection with supervised query match learning. Specifically, we use lexicon-driven concept detection to detect relevant findings in sentences. These findings are used as queries to train a Sentence-BERT (SBERT) model using triplet loss on matched and unmatched query-sentence pairs. We show that the proposed supervised training task remarkably improves the retrieval performance of SBERT. The trained model generalizes well to unseen queries and reports from different collections.

References Powered by Scopus

BioBERT: A pre-trained biomedical language representation model for biomedical text mining

3846Citations
N/AReaders
Get full text

The probabilistic relevance framework: BM25 and beyond

2160Citations
N/AReaders
Get full text

Billion-Scale Similarity Search with GPUs

1689Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Neural Natural Language Processing for unstructured data in electronic health records: A review

105Citations
N/AReaders
Get full text

Clinical Information Retrieval: A Literature Review

7Citations
N/AReaders
Get full text

Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and Abbreviation De-hallucination

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Shi, L., Syeda-Mahmood, T. F., & Baldwin, T. (2022). Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation. In NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 3457–3463). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.naacl-main.253

Readers over time

‘22‘23‘24‘250481216

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 7

58%

Researcher 4

33%

Lecturer / Post doc 1

8%

Readers' Discipline

Tooltip

Computer Science 10

63%

Neuroscience 2

13%

Medicine and Dentistry 2

13%

Linguistics 2

13%

Save time finding and organizing research with Mendeley

Sign up for free
0