Influence of outliers on some multiple imputation methods

  • Quintano C
  • Castellano R
  • Rocca A
N/ACitations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

In the field of data quality, imputation is the most used method for handling missing data. The performance of imputation techniques is influenced by various factors, especially when data represent only a sample of population, for example the survey design characteristics. In this paper, we compare the results of different multiple imputation methods in terms of final estimates when outliers occur in a dataset. Consequently, in order to evaluate the influence of outliers on the performance of these methods, the procedure is applied before and after that we have identified and removed them. For this purpose, missing data were simulated on data coming from sample ISTAT annual survey on Small and Medium Enterprises. MAR mechanism is assumed for missing data. The methods are based on the multiple imputation through the Markov Chain Monte Carlo (MCMC), the propensity score and the mixture models. The results highlight the strong influence of data characteristics on final estimates.

Cite

CITATION STYLE

APA

Quintano, C., Castellano, R., & Rocca, A. (2010). Influence of outliers on some multiple imputation methods. Advances in Methodology and Statistics, 7(1). https://doi.org/10.51936/tuki4538

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free