Analysis of data quality issues in real-world industrial data

12Citations
Citations of this article
28Readers
Mendeley users who have this article in their library.

Abstract

In large industries usage of advanced technological methods and modern equipment comes with the problem of storing, interpreting and analyzing huge amount of information. Handling information becomes more complicated and important at the same time. So, data quality is one of major challenges considering a rapid growth of information, fragmentation of information systems, incorrect data formatting and other issues. The aim of this paper is to describe industrial data processing and analytics on the real-world use case. The most crucial data quality issues are described, examined and classified in terms of Data Quality Dimensions. Factual industrial information supports and illustrates each encountered data deficiency. In addition, we describe methods for elimination data quality issues and data analysis techniques, which are applied after cleaning data procedure. In addition, an approach to address data quality problems in large-scale industrial datasets is proposed. This techniques and methods comprise several well-known techniques, which come from both worlds of mathematical logic and also statistics, improving data quality procedure and cleaning results.

Cite

CITATION STYLE

APA

Hubauer, T., Lamparter, S., Roshchin, M., Solomakhina, N., & Watson, S. (2013). Analysis of data quality issues in real-world industrial data. In PHM 2013 - Proceedings of the Annual Conference of the Prognostics and Health Management Society 2013 (pp. 685–693). Prognostics and Health Management Society. https://doi.org/10.36001/phmconf.2013.v5i1.2198

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free