In this paper we propose a novel combined approach to solve the imbalanced data issue in the application to the problem of the post-operative life expectancy prediction for the lung cancer patients. This solution makes use of undersampling techniques together with cost-sensitive SVM (Support Vector Machines). First, we eliminate non-informative examples by applying Tomek links together with onesided selection. Second, we take advantage of using cost-sensitive SVM with penalty costs calculated respecting cardinalities of minority and majority examples. We evaluate the presented solution by comparing the performance of our method with SVM-based approaches that deal with uneven data. The experimental evaluation was performed on reallife data from the postoperative risk management domain.
CITATION STYLE
Zięba, M., Świątek, J., & Lubicz, M. (2014). Cost sensitive SVM with non-informative examples elimination for imbalanced postoperative risk management problem. In Advances in Intelligent Systems and Computing (Vol. 240, pp. 305–314). Springer Verlag. https://doi.org/10.1007/978-3-319-01857-7_29
Mendeley helps you to discover research relevant for your work.