Abstract
A stroke, also known as a brain attack, is a serious medical condition that occurs when the blood supply to the brain is disrupted. It is a leading cause of death globally, accounting for about 11% of all deaths. In this paper, the author opts to use logistic regression for predicting the stroke. The paper starts with introducing the methods used to preprocess the raw dataset, including data cleaning, label encoding, oversampling, splitting the dataset and finally feature scaling. Then it goes to the modeling section. In this section, the author introduces the process of constructing a logistic regression model in detail. The first step is building the sigmoid function, a foundation of the model. Then the cost function is built to measure the difference between the model's predicted values and the true values. After that, the author constructs the gradient-computing function to determine the rate and direction at each iteration when implementing gradient descent. Based on this function, the gradient descent function is finally built, and predictions are made according to the outcome after putting testing variables into the model. By analyzing the results, the author also compares the different performances of the model between the one without regularization and the one equipped with regularization and then draw a conclusion that using regularization can help improve the performance of the model. At the end of the study, the author gets a rather satisfying result with a prediction accuracy of over 95% in the collected dataset.
Cite
CITATION STYLE
Wang, L. (2023). Logistic Regression for Stroke Prediction: An Evaluation of its Accuracy and Validity. Highlights in Science, Engineering and Technology, 39, 1086–1092. https://doi.org/10.54097/hset.v39i.6712
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.