MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area

10Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Nowadays, wide-area data analyses are pervasive with emerging geo-distributed systems. These analyses often need to do the global aggregation in the wide area. Since scarce and variable WAN bandwidth may degrade the aggregation performance, it is highly desired to design a communication scheme for global aggregation in WAN. Unfortunately, no existing algorithm can meet the three design requirements of communication schemes: fast computation, adaptive transmission, and accurate aggregation. In this paper, we propose MinMax Sampling, a fast, adaptive, and accurate communication scheme for global aggregation in WAN. We first focus on the accuracy and design a scheme, namely MinMaxopt, to achieve optimal accuracy. However, MinMaxopt does not meet the other two requirements: fast computation and adaptive transmission. Based on MinMaxopt, we propose MinMaxadp, which trades little accuracy for the other two requirements. We evaluate MinMaxadp with three applications: federated learning, distributed state aggregation, and hierarchical aggregation. Our experimental results show that MinMaxadp is superior to existing algorithms (8.44× better accuracy on average) in all three applications. The source codes of MinMax Sampling are available at Github [1].

Cite

CITATION STYLE

APA

Zhao, Y., Zhang, Y., Li, Y., Zhou, Y., Chen, C., Yang, T., & Cui, B. (2022). MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 744–758). Association for Computing Machinery. https://doi.org/10.1145/3514221.3526160

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free