Prediction of 1-Year Activity in Systemic Lupus Erythematosus: Hierarchical Machine Learning Approach

1Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Background: Systemic lupus erythematosus (SLE) is a chronic disease characterized by a broad spectrum of involved organs, including neurological, renal, and vascular domains, with disease activity manifesting through unpredictable patterns that vary across individuals and over time, making the prediction of activity events particularly challenging. Objective: This paper proposes a hierarchical machine learning model to predict a 12-month SLE activity, defined as the occurrence of at least one event among SLE hospitalization, new organ-involved domain, and neurological, renal, or vascular manifestation within the following year. At each patient’s visit, the model considers all the features at the current time point, the information about the patient’s clinical history, and about its last 12 months, to predict the outcome for the next 12 months. Methods: The study cohort consists of 262 patients with at least an outpatient visit and an SLE admission from 2012 to 2020, at the Italian Gemelli Hospital, comprising a retrospective longitudinal dataset of 5962 contacts. The data include demographics, laboratory, clinical features (eg, domain involvements and manifestations), treatments, and pathways (eg, contact types as outpatients, hospitalizations, day hospitals, and visit frequency). The variables consider 3 time ranges: features about the current contact and the last 12 months, and the previous patient’s clinical history. The main model was developed by testing different machine learning approaches within a cross-validation setup. The predicted probability outputs were used in a risk stratification analysis, identifying 3 groups of predictions: strong, moderate, and mild. Mild samples were then passed through a second cascade model. The integration of the main model (applied to strong and moderate samples) with the cascade model (applied to mild contacts) forms our final hierarchical model. Results: The hierarchical model, resulting from the ensemble of the main random forest and cascade decision tree, demonstrated enhanced performance, increasing the area under the receiver operating characteristic curve from 0.696 (95% CI 0.672‐0.719) in the original main model to 0.743 (95% CI 0.717-0.769), particularly for specific patient characteristics. Through the application of explainable artificial intelligence methods, we also identified the key features that significantly influence the model’s predictions. Among the 185 collected features, 15 emerged as the most impactful, including age at contact, response to therapy modifications, abnormal laboratory tests, and clinical manifestations. This analysis plays a crucial role in enhancing model transparency, which is essential for fostering the adoption of artificial intelligence in health care settings. Conclusions: Our study introduces an explainable and reliable tool for predicting 1-year SLE activity, supporting physicians with an advanced decision-support system to improve patient management. The model identifies key features that may help characterize patient phenotypes, enabling personalized treatment plans and better outcomes. In addition, the methodology can be generalized for predictive analytics in other chronic autoimmune diseases.

Cite

CITATION STYLE

APA

Lilli, L., Antenucci, L., Ortolan, A., Bosello, S. L., Patarnello, S., Masciocchi, C., … Lenkowicz, J. (2025). Prediction of 1-Year Activity in Systemic Lupus Erythematosus: Hierarchical Machine Learning Approach. JMIR Formative Research, 9. https://doi.org/10.2196/70200

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free