Imputation methods for missing data for a proposed VASA dataset

S. Anitha; M. Vanitha

Journal Article

Imputation methods for missing data for a proposed VASA dataset

International Journal of Innovative Technology and Exploring Engineering (2019) 9(1) 1950-1953

DOI: 10.35940/ijitee.A5204119119

4Citations

3Readers

Get full text

Abstract

Preprocessing is the presentation of raw data before apply the actual statistical method. Data preprocessing is one of the most vital steps in data mining process and it deals with the preparation and transformation of the initial dataset. It is prominent because the investigating data which is not properly preprocessed could lead to the result which is not accurate and meaningless. Almost every research have missing data and introduce an element into data analysis using some method. To consider the missing values that need to provide an efficient and valid analysis. Missing imputation is one of the process in data cleaning. Here, four different types of imputation methods are compared: Mean, Singular Value Decomposition (SVD), K-Nearest Neighbors (KNN), Bayesian Principal Component Analysis (BPCA). Comparison was performed in the real VASA dataset and based on performance evaluation criteria such as Mean Square Error (MSE) and Root Mean Square Error (RMSE). BPCA is the best imputation method of interest which deserve further consideration in practice.

Author supplied keywords

Cite

CITATION STYLE

APA

Anitha, S., & Vanitha, M. (2019). Imputation methods for missing data for a proposed VASA dataset. International Journal of Innovative Technology and Exploring Engineering, 9(1), 1950–1953. https://doi.org/10.35940/ijitee.A5204119119

Imputation methods for missing data for a proposed VASA dataset

Abstract

Author supplied keywords

Cite

Register to see more suggestions