A Self-training Framework for Automated Medical Report Generation

Siyuan Wang; Zheng Liu; Bo Peng

Conference ProceedingsOPEN ACCESS

A Self-training Framework for Automated Medical Report Generation

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (2023) 16443-16449

DOI: 10.18653/v1/2023.emnlp-main.1024

6Citations

9Readers

Abstract

Medical report generation, focusing on automatically generating accurate clinical findings from medical images, is an important medical artificial intelligence task. It reduces the workload of physicians in writing reports. Many of the current methods depend heavily on labeled datasets that include image-report pairs, but such datasets labeled by physicians are hard to acquire in clinical practice. In this paper, we introduce a self-training framework named REMOTE (i.e., Revisiting sElf-training for Medical repOrT gEneration) to exploit the unlabeled medical images and a MedCLIPScore to augment a small-scale dataset for training the medical report generation model. Experiments conducted on the MIMIC-CXR benchmark dataset and a COVID-19 dataset demonstrate that, our REMOTE framework, using only 1% labeled training data, achieves competitive performance with previous methods that are trained on entire training data.

Cite

CITATION STYLE

APA

Wang, S., Liu, Z., & Peng, B. (2023). A Self-training Framework for Automated Medical Report Generation. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 16443–16449). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.1024

A Self-training Framework for Automated Medical Report Generation

Abstract

Cite

Register to see more suggestions