Missing data in spatiotemporal datasets: the UK rainfall chemistry network

3Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Rainfall chemistry networks inevitably report some missing data, caused by contamination or loss of samples. However, there are no universally accepted rules about how such data, particularly from samples contaminated in the field, are identified and reported, leading to uncertainties in data usage by third parties, and possible incorrect inferences based on the reported data. This paper describes how the UK rainfall chemistry network data have been analysed for contamination, and how missing values can be estimated based on cross-correlations in time and space, using data from 20 sites over 26 years. The final flagged dataset is available through the CEH Environmental Information Data Centre (EIDC). Erroneous data values are identified through consideration of ion balance (internal consistency), and evidence of contamination by birds or windblown dust based on the reported chemical analysis. Overall data capture with the erroneous data excluded and no replacement of missing data was 86%, but with much smaller data capture at some sites in some years, to <30% in some cases. The use of estimated data to replace missing values resulted in an increase in overall data capture to 96%, with only one site having data capture <70% in an individual year, and all sites achieving a data capture of 88% or more over the full period. The implications of using the reported ‘official’ annual data, as opposed to the dataset with missing values replaced by estimates, are illustrated by consideration of the temporal trend in nitrate at one site, which shows twice the value in the ‘official’ reported annual data compared with the ‘estimated’ data, part of a consistent pattern across all sites. Use of the uncorrected ‘raw’ sample data leads to large errors.

References Powered by Scopus

A global assessment of precipitation chemistry and deposition of sulfur, nitrogen, sea salt, base cations, organic acids, acidity and pH, and phosphorus

652Citations
N/AReaders
Get full text

Introduction to the European Monitoring and Evaluation Programme (EMEP) and observed atmospheric composition change during 1972-2009

537Citations
N/AReaders
Get full text

Calibration of collection procedures for the determination of precipitation chemistry

146Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Throughfall and bulk deposition of dissolved organic nitrogen to holm oak forests in the Iberian Peninsula: Flux estimation and identification of potential sources

38Citations
N/AReaders
Get full text

Self-organizing maps for exploration of partially observed data and imputation of missing values

7Citations
N/AReaders
Get full text

Imputation of missing microclimate data of coffee-pine agroforestry with machine learning

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Cape, J. N., Smith, R. I., & Leaver, D. (2015). Missing data in spatiotemporal datasets: the UK rainfall chemistry network. Geoscience Data Journal, 2(1), 25–30. https://doi.org/10.1002/gdj3.24

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

44%

Lecturer / Post doc 3

33%

Researcher 2

22%

Readers' Discipline

Tooltip

Computer Science 2

29%

Earth and Planetary Sciences 2

29%

Environmental Science 2

29%

Sports and Recreations 1

14%

Save time finding and organizing research with Mendeley

Sign up for free