Mining conditional functional dependency rules on big data

44Citations
Citations of this article
85Readers
Mendeley users who have this article in their library.

Abstract

Current Conditional Functional Dependency (CFD) discovery algorithms always need a well-prepared training dataset. This condition makes them difficult to apply on large and low-quality datasets. To handle the volume issue of big data, we develop the sampling algorithms to obtain a small representative training set. We design the fault-tolerant rule discovery and conflict-resolution algorithms to address the low-quality issue of big data. We also propose parameter selection strategy to ensure the effectiveness of CFD discovery algorithms. Experimental results demonstrate that our method can discover effective CFD rules on billion-tuple data within a reasonable period.

Cite

CITATION STYLE

APA

Li, M., Wang, H., & Li, J. (2020). Mining conditional functional dependency rules on big data. Big Data Mining and Analytics, 3(1), 68–84. https://doi.org/10.26599/BDMA.2019.9020019

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free