TYPE-AWARE MEDICAL VISUAL QUESTION ANSWERING

Anda Zhang; Wei Tao; Ziyan Li; Haofen Wang; Wenqiang Zhang

Conference Proceedings

TYPE-AWARE MEDICAL VISUAL QUESTION ANSWERING

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (2022) 2022-May 4838-4842

DOI: 10.1109/ICASSP43922.2022.9747087

18Citations

5Readers

Get full text

Abstract

Medical Visual Question Answering (Med-VQA) helps answer medical questions raised by patients automatically so as to relieve the shortage of experienced doctors. Cross-modal feature alignment is a major challenge of Med-VQA. Moreover, it is critical to exploit sufficient semantic features with the consideration of characteristic of medical images and language. In this paper, we propose a novel From Image type point To Sentence (FITS) method to tackle the above challenge. In particular, the type of the medical images is represented as a type point which is further considered in the question sentence representation. The combined representation aims to optimize the feature distribution in an embedding space and thus enhances the ability of semantic alignment. Type point is also used in two feature extraction modules for medical questions and images respectively, which can efficiently improve the reasoning ability of different modalities, and further enhance the applicability of the fusion method for Med-VQA. The experimental results show that FITS outperforms all the previous approaches in terms of accuracy especially in open-ended questions significantly.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, A., Tao, W., Li, Z., Wang, H., & Zhang, W. (2022). TYPE-AWARE MEDICAL VISUAL QUESTION ANSWERING. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings (Vol. 2022-May, pp. 4838–4842). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICASSP43922.2022.9747087

TYPE-AWARE MEDICAL VISUAL QUESTION ANSWERING

Abstract

Author supplied keywords

Cite

Register to see more suggestions