Multistage adaptive load balancing for big active data publish subscribe systems

8Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we address issues in the design and operation of a Big Active Data Publish Subscribe (BAD Pub/Sub) systems to enable the next generation of enriched notification systems that can scale to societal levels. The proposed BAD Pub/Sub system will aim to ingest massive amounts of data from heterogeneous publishers and sources and deliver customized, enriched notifications to end users (subscribers) that express interests in these data items via parameterized channels. To support scalability, we employ a hierarchical architecture that combines a back-end big data cluster (to receive publications and data feeds, store data and process subscriptions) with a client-facing distributed broker network that manages user subscriptions and scales the delivery process. A key aspect of our broker capacity is its ability to aggregate subscriptions from end users to immensely reduce the end to end overheads and loads. The skewed distribution of subscribers, their interests and the dynamic nature of societal scale publications, create load imbalance in the distributed broker network. We mathematically formulate the notion of broker load in this setting and derive an optimization problem to minimize the maximum load (an NP-hard problem). We propose a staged approach for broker load balancing that executes in multiple stages - - initial placement of brokers to subscribers, dynamic subscriber migration during operation to handle transient and instantaneous loads and occasional shuffles to re-stabilize the system. We develop a prototype implementation of our staged load balancing on a real BAD Pub/Sub testbed (multinode cluster) with a distributed broker network and conduct experiments using real world workloads. We further evaluate our schemes via a detailed simulation studies.

References Powered by Scopus

Chord: A scalable peer-to-peer lookup service for internet applications

4578Citations
N/AReaders
Get full text

Pastry: Scalable, decentralized object location, and routing for large-scale peer-to-peer systems

3512Citations
N/AReaders
Get full text

Stochastic models of load balancing and scheduling in cloud computing clusters

292Citations
N/AReaders
Get full text

Cited by Powered by Scopus

DisGB: Using geo-context information for efficient routing in geo-distributed pub/sub systems

12Citations
N/AReaders
Get full text

Comat: An effective composite matching framework for content-based pub/sub systems

7Citations
N/AReaders
Get full text

Subscribing to big data at scale

4Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Nguyen, H., Uddin, M. Y. S., & Venkatasubramanian, N. (2019). Multistage adaptive load balancing for big active data publish subscribe systems. In DEBS 2019 - Proceedings of the 13th ACM International Conference on Distributed and Event-Based Systems (pp. 43–54). Association for Computing Machinery, Inc. https://doi.org/10.1145/3328905.3329508

Readers over time

‘19‘20‘21‘22‘23‘2400.751.52.253

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 4

57%

Professor / Associate Prof. 2

29%

Lecturer / Post doc 1

14%

Readers' Discipline

Tooltip

Computer Science 6

75%

Decision Sciences 1

13%

Engineering 1

13%

Save time finding and organizing research with Mendeley

Sign up for free
0