Handling imbalanced data in churn prediction using RUSBoost and feature selection (Case study: PT. Telekomunikasi Indonesia regional 7)

Erna Dwiyanti; undefined Adiwijaya; Arie Ardiyanti

Conference Proceedings

Handling imbalanced data in churn prediction using RUSBoost and feature selection (Case study: PT. Telekomunikasi Indonesia regional 7)

Advances in Intelligent Systems and Computing (2017) 549 AISC 376-385

DOI: 10.1007/978-3-319-51281-5_38

10Citations

17Readers

Get full text

Abstract

Solving imbalance problems is a challenging tasks in data mining and machine learning. Most classifiers are biased towards the majority class examples when learning from highly imbalanced data. In practice, churn prediction is considered as one of data mining application that reflects imbalance problems. This study investigates how to handle class imbalance in churn prediction using RUSBoost, a combination of random under-sampling and boosting algorithm, which is combined with feature selection for better performance result. The datasets used are broadband internet data collected from a telecommunication industry in Indonesia. The study firstly select the important features using Information Gain, and then building churn prediction model using RUSBoost with C4.5 as the weak learner. The result shows that feature selection and RUSBoost improve 16% of the performance of prediction and reduce 48% of the processing time.

Author supplied keywords

Cite

CITATION STYLE

APA

Dwiyanti, E., Adiwijaya, & Ardiyanti, A. (2017). Handling imbalanced data in churn prediction using RUSBoost and feature selection (Case study: PT. Telekomunikasi Indonesia regional 7). In Advances in Intelligent Systems and Computing (Vol. 549 AISC, pp. 376–385). Springer Verlag. https://doi.org/10.1007/978-3-319-51281-5_38

Handling imbalanced data in churn prediction using RUSBoost and feature selection (Case study: PT. Telekomunikasi Indonesia regional 7)

Abstract

Author supplied keywords

Cite

Register to see more suggestions