Abstract
Summarization of clinical narratives is a longstanding research problem. Here, we introduce the task of hospital-course summarization. Given the documentation authored throughout a patient’s hospitalization, generate a paragraph that tells the story of the patient admission. We construct an English, text-to-text dataset of 109,000 hospitalizations (2M source notes) and their corresponding summary proxy: the clinician-authored “Brief Hospital Course” paragraph written as part of a discharge note. Exploratory analyses reveal that the BHC paragraphs are highly abstractive with some long extracted fragments; are concise yet comprehensive; differ in style and content organization from the source notes; exhibit minimal lexical cohesion; and represent silver-standard references. Our analysis identifies multiple implications for modeling this complex, multi-document summarization task.
Cite
CITATION STYLE
Adams, G., Alsentzer, E., Ketenci, M., Zucker, J., & Elhadad, N. (2021). What’s in a Summary? Laying the Groundwork for Advances in Hospital-Course Summarization. In NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference (pp. 4794–4811). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.naacl-main.382
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.