MaNIACS: Approximate Mining of Frequent Subgraph Patterns through Sampling

12Citations
Citations of this article
19Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We present MaNIACS, a sampling-based randomized algorithm for computing high-quality approximations of the collection of the subgraph patterns that are frequent in a single, large, vertex-labeled graph, according to the Minimum Node Image-based (MNI) frequency measure. The output of MaNIACS comes with strong probabilistic guarantees, obtained by using the empirical Vapnik-Chervonenkis (VC) dimension, a key concept from statistical learning theory, together with strong probabilistic tail bounds on the difference between the frequency of a pattern in the sample and its exact frequency. MaNIACS leverages properties of the MNI-frequency to aggressively prune the pattern search space, and thus to reduce the time spent in exploring subspaces containing no frequent patterns. In turn, this pruning leads to better bounds to the maximum frequency estimation error, which leads to increased pruning, resulting in a beneficial feedback effect. The results of our experimental evaluation of MaNIACS on real graphs show that it returns high-quality collections of frequent patterns in large graphs up to two orders of magnitude faster than the exact algorithm.

Cite

CITATION STYLE

APA

Preti, G., De Francisci Morales, G., & Riondato, M. (2021). MaNIACS: Approximate Mining of Frequent Subgraph Patterns through Sampling. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1348–1358). Association for Computing Machinery. https://doi.org/10.1145/3447548.3467344

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free