Performance of Large Language Models on Medical Oncology Examination Questions

Jack B. Longwell; Ian Hirsch; Fernando Binder; Galileo Arturo Gonzalez Conchas; Daniel Mau; Raymond Jang; Rahul G. Krishnan; Robert C. Grant

Journal ArticleOPEN ACCESS

Performance of Large Language Models on Medical Oncology Examination Questions

JAMA Network Open (2024) 7(6) e2417641

DOI: 10.1001/jamanetworkopen.2024.17641

45Citations

63Readers

Abstract

Importance: Large language models (LLMs) recently developed an unprecedented ability to answer questions. Studies of LLMs from other fields may not generalize to medical oncology, a high-stakes clinical setting requiring rapid integration of new information. Objective: To evaluate the accuracy and safety of LLM answers on medical oncology examination questions. Design, Setting, and Participants: This cross-sectional study was conducted between May 28 and October 11, 2023. The American Society of Clinical Oncology (ASCO) Oncology Self-Assessment Series on ASCO Connection, the European Society of Medical Oncology (ESMO) Examination Trial questions, and an original set of board-style medical oncology multiple-choice questions were presented to 8 LLMs. Main Outcomes and Measures: The primary outcome was the percentage of correct answers. Medical oncologists evaluated the explanations provided by the best LLM for accuracy, classified the types of errors, and estimated the likelihood and extent of potential clinical harm. Results: Proprietary LLM 2 correctly answered 125 of 147 questions (85.0%; 95% CI, 78.2%-90.4%; P

Cite

CITATION STYLE

APA

Longwell, J. B., Hirsch, I., Binder, F., Gonzalez Conchas, G. A., Mau, D., Jang, R., … Grant, R. C. (2024). Performance of Large Language Models on Medical Oncology Examination Questions. JAMA Network Open, 7(6), e2417641. https://doi.org/10.1001/jamanetworkopen.2024.17641

Performance of Large Language Models on Medical Oncology Examination Questions

Abstract

Cite

Register to see more suggestions