Managing the synchronization in the lambda architecture for optimized big data analysis

Thomas Vanhove; Gregory Van Seghbroeck; Tim Wauters; Bruno Volckaert; Filip De Turck

Journal ArticleOPEN ACCESS

Managing the synchronization in the lambda architecture for optimized big data analysis

IEICE Transactions on Communications (2016) E99B(2) 297-306

DOI: 10.1587/transcom.2015ITI0001

7Citations

25Readers

Abstract

In a world of continuously expanding amounts of data, retrieving interesting information from enormous data sets becomes more complex every day. Solutions for precomputing views on these big data sets mostly follow either an offline approach, which is slow but can take into account the entire data set, or a streaming approach, which is fast but only relies on the latest data entries. A hybrid solution was introduced through the Lambda architecture concept. It combines both offline and streaming approaches by analyzing data in a fast speed layer first, and in a slower batch layer later. However, this introduces a new synchronization challenge: once the data is analyzed by the batch layer, the corresponding information needs to be removed in the speed layer without introducing redundancy or loss of data. In this paper we propose a new approach to implement the Lambda architecture concept independent of the technologies used for offline and stream computing. A universal solution is provided to manage the complex synchronization introduced by the Lambda architecture and techniques to provide fault tolerance. The proposed solution is evaluated by means of detailed experimental results.

Author supplied keywords

Cite

CITATION STYLE

APA

Vanhove, T., Van Seghbroeck, G., Wauters, T., Volckaert, B., & De Turck, F. (2016). Managing the synchronization in the lambda architecture for optimized big data analysis. IEICE Transactions on Communications, E99B(2), 297–306. https://doi.org/10.1587/transcom.2015ITI0001

Managing the synchronization in the lambda architecture for optimized big data analysis

Abstract

Author supplied keywords

Cite

Register to see more suggestions