Abstract
Background: Deep learning breast cancer risk models demonstrate improved accuracy compared with traditional risk models but have not been prospectively tested. We compared the accuracy of a deep learning risk score derived from the patient's prior mammogram to traditional risk scores to prospectively identify patients with cancer in a cohort due for screening. Methods: We collected data on 119 139 bilateral screening mammograms in 57 617 consecutive patients screened at 5 facilities between September 18, 2017, and February 1, 2021. Patient demographics were retrieved from electronic medical records, cancer outcomes determined through regional tumor registry linkage, and comparisons made across risk models using Wilcoxon and Pearson χ2 2-sided tests. Deep learning, Tyrer-Cuzick, and National Cancer Institute Breast Cancer Risk Assessment Tool (NCI BCRAT) risk models were compared with respect to performance metrics and area under the receiver operating characteristic curves. Results: Cancers detected per thousand patients screened were higher in patients at increased risk by the deep learning model (8.6, 95% confidence interval [CI] = 7.9 to 9.4) compared with Tyrer-Cuzick (4.4, 95% CI = 3.9 to 4.9) and NCI BCRAT (3.8, 95% CI = 3.3 to 4.3) models (P
Cite
CITATION STYLE
Lehman, C. D., Mercaldo, S., Lamb, L. R., King, T. A., Ellisen, L. W., Specht, M., & Tamimi, R. M. (2022). Deep Learning vs Traditional Breast Cancer Risk Models to Support Risk-Based Mammography Screening. Journal of the National Cancer Institute, 114(10), 1355–1363. https://doi.org/10.1093/jnci/djac142
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.