Combine vector quantization and support vector machine for imbalanced datasets

8Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

In cases of extremely imbalanced dataset with high dimensions, standard machine learning techniques tend to be overwhelmed by the large classes. This paper rebalances skewed datasets by compressing the majority class. This approach combines Vector Quantization and Support Vector Machine and constructs a new approach, VQ-SVM, to rebalance datasets without significant information loss. Some issues, e.g. distortion and support vectors, have been discussed to address the trade-off between the information loss and undersampling. Experiments compare VQ-SVM and standard SVM on some imbalanced datasets with varied imbalance ratios, and results show that the performance of VQ-SVM is superior to SVM, especially in case of extremely imbalanced large datasets. © 2006 International Federation for Information Processing.

Cite

CITATION STYLE

APA

Yu, T., Debenham, J., Jan, T., & Simoff, S. (2006). Combine vector quantization and support vector machine for imbalanced datasets. IFIP International Federation for Information Processing, 217, 81–88. https://doi.org/10.1007/978-0-387-34747-9_9

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free