Iterative development of family history annotation guidelines using a synthetic corpus of clinical text

12Citations
Citations of this article
82Readers
Mendeley users who have this article in their library.

Abstract

In this article, we describe the development of annotation guidelines for family history information in Norwegian clinical text. We make use of incrementally developed synthetic clinical text describing patients' family history relating to cases of cardiac disease and present a general methodology which integrates the synthetically produced clinical statements and guideline development. We analyze inter-annotator agreement based on the developed guidelines and present results from experiments aimed at evaluating the validity and applicability of the annotated corpus using machine learning techniques. The resulting annotated corpus contains 477 sentences and 6030 tokens. Both the annotation guidelines and the annotated corpus are made freely available and as such constitutes the first publicly available resource of Norwegian clinical text.

Cite

CITATION STYLE

APA

Rama, T., Brekke, P. H., Nytrø, Ø., & Øvrelid, L. (2018). Iterative development of family history annotation guidelines using a synthetic corpus of clinical text. In EMNLP 2018 - 9th International Workshop on Health Text Mining and Information Analysis, LOUHI 2018 - Proceedings of the Workshop (pp. 111–121). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w18-5613

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free