Physician Use of Large Language Models: A Quantitative Study Based on Large-Scale Query-Level Data

Lin Qiu; Chuang Tang; Xuan Bi; Gordon Burtch; Yanmin Chen; Heping Zhang

Journal Article

Physician Use of Large Language Models: A Quantitative Study Based on Large-Scale Query-Level Data

Journal of Medical Internet Research (2025) 27

DOI: 10.2196/76941

4Citations

42Readers

Get full text

Abstract

Background: Generative artificial intelligence (GenAI) has rapidly emerged as a promising tool in health care. Despite its growing adoption, how physicians make use of it in medical practice has not been qualitatively studied. Existing literature has largely focused on theoretical applications or experimental validations, with limited insight into real-world physician engagement with GenAI technologies. Objective: The aim of this study was to leverage a fine-grained dataset at the query level to quantitatively examine how physicians incorporate GenAI into their clinical and research workflows. The primary objective was to analyze usage patterns over time and across physician demographics. A secondary goal was to assess potential risks to patient privacy arising from physicians’ interactions with GenAI platforms. Methods: This study collected 106,942 query-and-answer pairs by 989 physicians between August 29, 2023, and April 16, 2024. We performed topic classification to identify the most prevalent use cases, examining how these use cases evolved over time and across demographics. We also developed sensitivity classifiers to detect personally identifiable information in physicians’ queries to explore the potential privacy breach risks around physicians’ use of GenAI. Results: Approximately 40% (396/989) of the enrolled physicians were female, 45.9% (454/989) were younger than 25 years, and 54.1% (535/989) were between 25 and 56 years of age. The majority of them worked in clinical departments (680/989, 68.8%) or medical technology departments (127/989, 12.8%). Our classification-based quantitative analyses suggest the following. First, physicians use GenAI predominantly for medical research (64,379/106,942, 60.2%) rather than clinical practice (13,100/106,942, 12.25%). Second, physicians focus more on health care–related questions (rising from 64,165/106,942, 60% to 83,415/106,942, 78%) within the first 15% (16,041/106,942) of their query sequence. Third, the use of GenAI differed across physician demographics and features. Specifically, female physicians asked a larger proportion of clinical questions (female: 0.154 vs male: 0.108; P 40: 0.103; P 40: 0.664; P

Author supplied keywords

Cite

CITATION STYLE

APA

Qiu, L., Tang, C., Bi, X., Burtch, G., Chen, Y., & Zhang, H. (2025). Physician Use of Large Language Models: A Quantitative Study Based on Large-Scale Query-Level Data. Journal of Medical Internet Research, 27. https://doi.org/10.2196/76941

Physician Use of Large Language Models: A Quantitative Study Based on Large-Scale Query-Level Data

Abstract

Author supplied keywords

Cite

Register to see more suggestions