Data interoperability is a major issue in data management for data science and big data analytics. Probabilistic data integration (PDI) is a specific kind of data integration where extraction and integration problems such as inconsistency and uncertainty are handled by means of a probabilistic data representation. This allows a data integration process with two phases: (1) a quick partial integration where data quality problems are represented as uncertainty in the resulting integrated data, and (2) using the uncertain data and continuously improving its quality as more evidence is gathered. The main contribution of this paper is an iterative approach for incorporating evidence of users in the probabilistically integrated data. Evidence can be specified as hard or soft rules (i.e., rules that are uncertain themselves).
CITATION STYLE
van Keulen, M., Kaminski, B. L., Matheja, C., & Katoen, J. P. (2018). Rule-based conditioning of probabilistic data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11142 LNAI, pp. 290–305). Springer Verlag. https://doi.org/10.1007/978-3-030-00461-3_20
Mendeley helps you to discover research relevant for your work.