Constrained Classification of Large Imbalanced Data by Logistic Regression and Genetic Algorithm

  • Hlosta M
  • Stríž R
  • Kupčík J
  • et al.
N/ACitations
Citations of this article
15Readers
Mendeley users who have this article in their library.

Abstract

Imbalance in data classification is a frequently discussed problem that is not well handled by classical classification techniques. The problem we tackled was to learn binary classification model from large data with accuracy constraint for the minority class. We propose a new meta-learning method that creates initial models using cost-sensitive learning by logistic regression and uses these models as initial chromosomes for genetic algorithm. The method has been successfully tested on a large real-world data set from our internet security research. Experiments prove that our method always leads to better results than usage of logistic regression or genetic algorithm alone. Moreover, this method produces easily understandable classification model.

Cite

CITATION STYLE

APA

Hlosta, M., Stríž, R., Kupčík, J., Zendulka, J., & Hruška, T. (2013). Constrained Classification of Large Imbalanced Data by Logistic Regression and Genetic Algorithm. International Journal of Machine Learning and Computing, 214–218. https://doi.org/10.7763/ijmlc.2013.v3.305

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free