Ubiq: A Scalable and Fault-Tolerant Log Processing Infrastructure

0Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Most of today’s Internet applications generate vast amounts of data (typically, in the form of event logs) that needs to be processed and analyzed for detailed reporting, enhancing user experience and increasing monetization. In this paper, we describe the architecture of Ubiq, a geographically distributed framework for processing continuously growing log files in real time with high scalability, high availability and low latency. The Ubiq framework fully tolerates infrastructure degradation and data center-level outages without any manual intervention. It also guarantees exactly-once semantics for application pipelines to process logs as a collection of multiple events. Ubiq has been in production for Google’s advertising system for many years and has served as a critical log processing framework for several dozen pipelines. Our production deployment demonstrates linear scalability with machine resources, extremely high availability even with underlying infrastructure failures, and an end-to-end latency of under a minute.

Cite

CITATION STYLE

APA

Basker, V., Bhatia, M., Ganeshan, V., Gupta, A., He, S., Holzer, S., … Agrawal, D. (2019). Ubiq: A Scalable and Fault-Tolerant Log Processing Infrastructure. In Lecture Notes in Business Information Processing (Vol. 337, pp. 155–174). Springer. https://doi.org/10.1007/978-3-030-24124-7_10

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free