Abstract
Background The journey of >80% of patients diagnosed with lung cancer starts in general practice. About 75% of patients are diagnosed when it is at an advanced stage (3 or 4), leading to >80% mortality within 1 year at present. The long-term data in GP records might contain hidden information that could be used for earlier case finding of patients with cancer. Aim To develop new prediction tools that improve the risk assessment for lung cancer. Design and setting Text analysis of electronic patient data using natural language processing and machine learning in the general practice files of four networks in the Netherlands. Method Files of 525 526 patients were analysed, of whom 2386 were diagnosed with lung cancer. Diagnoses were validated by using the Dutch cancer registry, and both structured and free-text data were used to predict the diagnosis of lung cancer 5 months before diagnosis (4 months before referral). Results The algorithm could facilitate earlier detection of lung cancer using routine general practice data. Discrimination, calibration, sensitivity, and specificity were established under various cut-off points of the prediction 5 months before diagnosis. Internal validation of the best model demonstrated an area under the curve of 0.88 (95% confidence interval [CI] = 0.86 to 0.89), which shrunk to 0.79 (95% CI = 0.78 to 0.80) during external validation. The desired sensitivity determines the number of patients to be referred to detect one patient with lung cancer. Conclusion Artificial intelligence-based support enables earlier detection of lung cancer in general practice using readily available text in the patient files of GPs, but needs additional prospective clinical evaluation.
Author supplied keywords
Cite
CITATION STYLE
Schut, M. C., Luik, T. T., Vagliano, I., Rios, M., Helsper, C. W., van Asselt, K. M., … van Weert, H. C. P. M. (2025). Artificial intelligence for early detection of lung cancer in GPs’ clinical notes: a retrospective observational cohort study. British Journal of General Practice, 75(754), e316–e322. https://doi.org/10.3399/BJGP.2023.0489
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.