Graph sampling with applications to estimating the number of pattern embeddings and the parameters of a statistical relational model

Irma Ravkic; Martin Žnidaršič; Jan Ramon; Jesse Davis

Journal Article

Graph sampling with applications to estimating the number of pattern embeddings and the parameters of a statistical relational model

Data Mining and Knowledge Discovery (2018) 32(4) 913-948

DOI: 10.1007/s10618-018-0553-2

2Citations

15Readers

Get full text

Abstract

Counting the number of times a pattern occurs in a database is a fundamental data mining problem. It is a subroutine in a diverse set of tasks ranging from pattern mining to supervised learning and probabilistic model learning. While a pattern and a database can take many forms, this paper focuses on the case where both the pattern and the database are graphs (networks). Unfortunately, in general, the problem of counting graph occurrences is #P-complete. In contrast to earlier work, which focused on exact counting for simple (i.e., very short) patterns, we present a sampling approach for estimating the statistics of larger graph pattern occurrences. We perform an empirical evaluation on synthetic and real-world data that validates the proposed algorithm, illustrates its practical behavior and provides insight into the trade-off between its accuracy of estimation and computational efficiency.

Author supplied keywords

Cite

CITATION STYLE

APA

Ravkic, I., Žnidaršič, M., Ramon, J., & Davis, J. (2018). Graph sampling with applications to estimating the number of pattern embeddings and the parameters of a statistical relational model. Data Mining and Knowledge Discovery, 32(4), 913–948. https://doi.org/10.1007/s10618-018-0553-2

Graph sampling with applications to estimating the number of pattern embeddings and the parameters of a statistical relational model

Abstract

Author supplied keywords

Cite

Register to see more suggestions