“Application and accuracy of artificial intelligence-derived large language models in patients with age related macular degeneration”

Lorenzo Ferro Desideri; Janice Roth; Martin Zinkernagel; Rodrigo Anguita

Journal ArticleOPEN ACCESS

“Application and accuracy of artificial intelligence-derived large language models in patients with age related macular degeneration”

International Journal of Retina and Vitreous (2023) 9(1)

DOI: 10.1186/s40942-023-00511-7

3Citations

14Readers

Abstract

Introduction: Age-related macular degeneration (AMD) affects millions of people globally, leading to a surge in online research of putative diagnoses, causing potential misinformation and anxiety in patients and their parents. This study explores the efficacy of artificial intelligence-derived large language models (LLMs) like in addressing AMD patients' questions. Methods: ChatGPT 3.5 (2023), Bing AI (2023), and Google Bard (2023) were adopted as LLMs. Patients’ questions were subdivided in two question categories, (a) general medical advice and (b) pre- and post-intravitreal injection advice and classified as (1) accurate and sufficient (2) partially accurate but sufficient and (3) inaccurate and not sufficient. Non-parametric test has been done to compare the means between the 3 LLMs scores and also an analysis of variance and reliability tests were performed among the 3 groups. Results: In category a) of questions, the average score was 1.20 (± 0.41) with ChatGPT 3.5, 1.60 (± 0.63) with Bing AI and 1.60 (± 0.73) with Google Bard, showing no significant differences among the 3 groups (p = 0.129). The average score in category b was 1.07 (± 0.27) with ChatGPT 3.5, 1.69 (± 0.63) with Bing AI and 1.38 (± 0.63) with Google Bard, showing a significant difference among the 3 groups (p = 0.0042). Reliability statistics showed Chronbach’s α of 0.237 (range 0.448, 0.096–0.544). Conclusion: ChatGPT 3.5 consistently offered the most accurate and satisfactory responses, particularly with technical queries. While LLMs displayed promise in providing precise information about AMD; however, further improvements are needed especially in more technical questions.

Author supplied keywords

Cite

CITATION STYLE

APA

Ferro Desideri, L., Roth, J., Zinkernagel, M., & Anguita, R. (2023). “Application and accuracy of artificial intelligence-derived large language models in patients with age related macular degeneration.” International Journal of Retina and Vitreous, 9(1). https://doi.org/10.1186/s40942-023-00511-7

“Application and accuracy of artificial intelligence-derived large language models in patients with age related macular degeneration”

Abstract

Author supplied keywords

Cite

Register to see more suggestions