Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study

Masao Noda; Takayoshi Ueno; Ryota Koshu; Yuji Takaso; Mari Dias Shimada; Chizu Saito; Hisashi Sugimoto; Hiroaki Fushiki; Makoto Ito; Akihiro Nomura; Tomokazu Yoshizaki

Journal ArticleOPEN ACCESS

Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study

JMIR Medical Education (2024) 10

DOI: 10.2196/57054

9Citations

24Readers

Get full text

Abstract

Background: Artificial intelligence models can learn from medical literature and clinical cases and generate answers that rival human experts. However, challenges remain in the analysis of complex data containing images and diagrams. Objective: This study aims to assess the answering capabilities and accuracy of ChatGPT-4 Vision (GPT-4V) for a set of 100 questions, including image-based questions, from the 2023 otolaryngology board certification examination. Methods: Answers to 100 questions from the 2023 otolaryngology board certification examination, including image-based questions, were generated using GPT-4V. The accuracy rate was evaluated using different prompts, and the presence of images, clinical area of the questions, and variations in the answer content were examined. Results: The accuracy rate for text-only input was, on average, 24.7% but improved to 47.3% with the addition of English translation and prompts (P

Cite

CITATION STYLE

APA

Noda, M., Ueno, T., Koshu, R., Takaso, Y., Shimada, M. D., Saito, C., … Yoshizaki, T. (2024). Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study. JMIR Medical Education, 10. https://doi.org/10.2196/57054

Performance of GPT-4V in Answering the Japanese Otolaryngology Board Certification Examination Questions: Evaluation Study

Abstract

Author supplied keywords

Cite

Register to see more suggestions