Streaming balanced graph partitioning algorithms for random graphs

38Citations
Citations of this article
57Readers
Mendeley users who have this article in their library.
Get full text

Abstract

With recent advances in storage technology, it is now possible to store the vast amounts of data generated by cloud computing applications. The sheer size of 'big data' motivates the need for streaming algorithms that can compute approximate solutions without full random access to all of the data. In this paper, we consider the problem of loading a graph onto a distributed cluster with the goal of optimizing later computation. We model this as computing an approximately balanced /c-partitioning of a graph in a streaming fashion with only one pass over the data. We give lower bounds on this problem, showing that no algorithm can obtain an o{n) approximation with a random or adversarial stream ordering. We analyze two variants of a randomized greedy algorithm (looking at the distribution of edges from the vertex to be assigned, one prefers the partition that is the argmax and one that assigns the vertex proportional to the edge distribution) on random graphs with embedded balanced K-cuts and are able to theoretically bound the performance of each algorithms - The argmax algorithm is able to recover the embedded fc-cut while the proportional variant can not. This matches the experimental results in [30]. Copyright © 2014 by the Society for Industrial and Applied Mathematics.

Cite

CITATION STYLE

APA

Stanton, I. (2014). Streaming balanced graph partitioning algorithms for random graphs. In Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms (pp. 1287–1301). Association for Computing Machinery. https://doi.org/10.1137/1.9781611973402.95

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free