A semi-supervised segmentation algorithm as applied to k-means using information value

  • Breed D
  • Verster T
  • Terblanche S
N/ACitations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

A semi-supervised segmentation algorithm as applied to k-means using information value DG Breed * T Verster † SE Terblanche ‡ Abstract Segmentation (or partitioning) of data for the purpose of enhancing predictive modelling is a well-established practice in the banking industry. Unsupervised and supervised approaches are the two main streams of segmentation and examples exist where the application of these techniques improved the performance of predictive models. Both these streams focus, however , on a single aspect (i.e. either target separation or independent variable distribution) and combining them may deliver better results in some instances. In this paper a semi-supervised segmentation algorithm is presented, which is based on k-means clustering and which applies information value for the purpose of informing the segmentation process. Simulated data are used to identify a few key characteristics that may cause one segmentation technique to out-perform another. In the empirical study the newly proposed semi-supervised segmentation algorithm outperforms both an unsupervised and a supervised segmentation technique, when compared by using the Gini coefficient as performance measure of the resulting predictive models.

Cite

CITATION STYLE

APA

Breed, D., Verster, T., & Terblanche, S. (2017). A semi-supervised segmentation algorithm as applied to k-means using information value. ORiON, 33(2), 85. https://doi.org/10.5784/33-2-568

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free