Measuring data completeness for microbial genomics database

9Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Poor quality data such as data with missing values (or records) cause negative consequences in many application domains. An important aspect of data quality is completeness. One problem in data completeness is the problem of missing individuals in data sets. Within a data set, the individuals refer to the real world entities whose information is recorded. So far, in completeness studies however, there has been little discussion about how missing individuals are assessed. In this paper, we propose the notion of population-based completeness (PBC) that deals with the missing individuals problem, with the aim of investigating what is required to measure PBC and to identify what is needed to support PBC measurements in practice. This paper explores the need of PBC in the microbial genomics where real sample data sets retrieved from a microbial database called Comprehensive Microbial Resources are used (CMR). © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Emran, N. A., Embury, S., Missier, P., Isa, M. N. M., & Muda, A. K. (2013). Measuring data completeness for microbial genomics database. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7802 LNAI, pp. 186–195). https://doi.org/10.1007/978-3-642-36546-1_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free