Resolving Elliptical Compounds in German Medical Text

4Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Elliptical coordinated compound noun phrases (ECCNPs), a special kind of coordination ellipsis, are a common phenomenon in German medical texts. As their presence is known to affect the performance in downstream tasks such as entity extraction and disambiguation, their resolution can be a useful preprocessing step in information extraction pipelines. In this work, we present a new comprehensive dataset of more than 4,000 manually annotated ECCNPs in German medical text, along with the respective ground truth resolutions. Based on this data, we propose a generative encoder–decoder Transformer model, allowing for a simple end-to-end resolution of ECCNPs from raw input strings with very high accuracy (90.5 % exact match score). We compare our approach to an elaborate rule-based baseline, which the generative model outperforms by a large margin. We further investigate different scenarios for prompting large language models (LLM) to resolve ECCNPs. In a zero-shot setting, performance is remarkably poor (21.6 % exact matches), as the LLM tends to apply complex changes to the inputs unrelated to our specific task. We also find no improvement over the generative model when using the LLM for post-filtering of generated candidate resolutions. The source code including instructions on how to access the data are available at: https://github.com/hpi-dhc/ggponc_ ellipses.

Cite

CITATION STYLE

APA

Kämmer, N., Borchert, F., Winkler, S., de Melo, G., & Schapranow, M. P. (2023). Resolving Elliptical Compounds in German Medical Text. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 292–305). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.bionlp-1.26

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free