Imputation methods for missing data for a proposed VASA dataset

4Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Preprocessing is the presentation of raw data before apply the actual statistical method. Data preprocessing is one of the most vital steps in data mining process and it deals with the preparation and transformation of the initial dataset. It is prominent because the investigating data which is not properly preprocessed could lead to the result which is not accurate and meaningless. Almost every research have missing data and introduce an element into data analysis using some method. To consider the missing values that need to provide an efficient and valid analysis. Missing imputation is one of the process in data cleaning. Here, four different types of imputation methods are compared: Mean, Singular Value Decomposition (SVD), K-Nearest Neighbors (KNN), Bayesian Principal Component Analysis (BPCA). Comparison was performed in the real VASA dataset and based on performance evaluation criteria such as Mean Square Error (MSE) and Root Mean Square Error (RMSE). BPCA is the best imputation method of interest which deserve further consideration in practice.

Cite

CITATION STYLE

APA

Anitha, S., & Vanitha, M. (2019). Imputation methods for missing data for a proposed VASA dataset. International Journal of Innovative Technology and Exploring Engineering, 9(1), 1950–1953. https://doi.org/10.35940/ijitee.A5204119119

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free