Extraction of temporal relations from clinical free text: A systematic review of current approaches

Ghada Alfattni; Niels Peek; Goran Nenadic

ArticleOPEN ACCESS

Extraction of temporal relations from clinical free text: A systematic review of current approaches

Journal of Biomedical Informatics

DOI: 10.1016/j.jbi.2020.103488

48Citations

117Readers

Abstract

Background: Temporal relations between clinical events play an important role in clinical assessment and decision making. Extracting such relations from free text data is a challenging task because it lies on between medical natural language processing, temporal representation and temporal reasoning. Objectives: To survey existing methods for extracting temporal relations (TLINKs) between events from clinical free text in English; to establish the state-of-the-art in this field; and to identify outstanding methodological challenges. Methods: A systematic search in PubMed and the DBLP computer science bibliography was conducted for studies published between January 2006 and December 2018. The relevant studies were identified by examining the titles and abstracts. Then, the full text of selected studies was analyzed in depth and information were collected on TLINK tasks, TLINK types, data sources, features selection, methods used, and reported performance. Results: A total of 2834 publications were identified for title and abstract screening. Of these publications, 51 studies were selected. Thirty-two studies used machine learning approaches, 15 studies used a hybrid approaches, and only four studies used a rule-based approach. The majority of studies use publicly available corpora: THYME (28 studies) and the i2b2 corpus (17 studies). Conclusion: The performance of TLINK extraction methods ranges widely depending on relation types and events (e.g. from 32% to 87% F-score for identifying relations between clinical events and document creation time). A small set of TLINKs (before, after, overlap and contains) has been widely studied with relatively good performance, whereas other types of TLINK (e.g., started by, finished by, precedes) are rarely studied and remain challenging. Machine learning classifiers (such as Support Vector Machine and Conditional Random Fields) and Deep Neural Networks were among the best performing methods for extracting TLINKs, but nearly all the work has been carried out and tested on two publicly available corpora only. The field would benefit from the availability of more publicly available, high-quality, annotated clinical text corpora.

Author supplied keywords

Cite

CITATION STYLE

APA

Alfattni, G., Peek, N., & Nenadic, G. (2020, August 1). Extraction of temporal relations from clinical free text: A systematic review of current approaches. Journal of Biomedical Informatics. Academic Press Inc. https://doi.org/10.1016/j.jbi.2020.103488

Extraction of temporal relations from clinical free text: A systematic review of current approaches

Abstract

Author supplied keywords

Cite

Register to see more suggestions