A fast data preprocessing procedure for support vector regression

Hao Zhifeng; Wen Wen; Yang Xiaowei; Lu Jie; Zhang Guangquan

Conference Proceedings

A fast data preprocessing procedure for support vector regression

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4224 LNCS 48-56

DOI: 10.1007/11875581_6

4Citations

4Readers

Get full text

Abstract

A fast data preprocessing procedure (FDPP) for support vector regression (SVR) is proposed in this paper. In the presented method, the dataset is firstly divided into several subsets and then K-means clustering is implemented in each subset. The clusters are classified by their group size. The centroids with small group size are eliminated and the rest centroids are used for SVR training. The relationships between the group sizes and the noisy clusters are discussed and simulations are also given. Results show that FDPP cleans most of the noises, preserves the useful statistical information and reduces the training samples. Most importantly, FDPP runs very fast and maintains the good regression performance of SVR. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Zhifeng, H., Wen, W., Xiaowei, Y., Jie, L., & Guangquan, Z. (2006). A fast data preprocessing procedure for support vector regression. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4224 LNCS, pp. 48–56). Springer Verlag. https://doi.org/10.1007/11875581_6

A fast data preprocessing procedure for support vector regression

Abstract

Cite

Register to see more suggestions