Comparative study of dimension reduction methods for highly imbalanced overlapping churn data

4Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Retention of possible churning customer is one of the most important issues in customer relationship management, so companies try to predict churn customers using their large-scale high-dimensional data. This study focuses on dealing with large data sets by reducing the dimensionality. By using six different dimension reduction methods-Principal Component Analysis (PCA), factor analysis (FA), locally linear embedding (LLE), local tangent space alignment (LTSA), locally preserving projections (LPP), and deep auto-encoder-our experiments apply each dimension reduction method to the training data, build a classification model using the mapped data and then measure the performance using hit rate to compare the dimension reduction methods. In the result, PCA shows good performance despite its simplicity, and the deep auto-encoder gives the best overall performance. These results can be explained by the characteristics of the churn prediction data that is highly correlated and overlapped over the classes. We also proposed a simple out-of-sample extension method for the nonlinear dimension reduction methods, LLE and LTSA, utilizing the characteristic of the data.

Cite

CITATION STYLE

APA

Lee, S., Koo, B., & Jung, K. H. (2014). Comparative study of dimension reduction methods for highly imbalanced overlapping churn data. Industrial Engineering and Management Systems, 13(4), 454–462. https://doi.org/10.7232/iems.2014.13.4.454

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free