Abstract
There is a growing awareness among life scientists of the variability in quality of the data in public repositories, and of the threat that poor data quality poses to the validity of experimental results. No standards are available, however, for computing quality levels in this data domain. We argue that data processing environments used by life scientists should feature facilities for expressing and applying quality-based, personal data acceptability criteria. We propose a framework for the specification of users’ quality processing requirements, called quality views. These views are compiled and semi-automatically embedded within the data processing environment. The result is a quality management toolkit that promotes rapid prototyping and reuse of quality components. We illustrate the utility of the framework by showing how it can be deployed within Taverna, a scientific workflow management tool, and applied to actual workflows for data analysis in proteomics.
Cite
CITATION STYLE
Missier, P., Embury, S., Greenwood, M., Preece, A., & Jin, B. (2006). Quality Views: Capturing and Exploiting the User Perspective on Data Quality. In VLDB 2006 - Proceedings of the 32nd International Conference on Very Large Data Bases (pp. 977–988). Association for Computing Machinery.
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.