ALOJA: A benchmarking and predictive platform for big data performance analysis

0Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The main goals of the ALOJA research project from BSC-MSR, are to explore and automate the characterization of costeffectiveness of Big Data deployments. The development of the project over its first year, has resulted in a open source benchmarking platform, an online public repository of results with over 42,000 Hadoop job runs, and web-based analytic tools to gather insights about system’s costperformance (ALOJA’s Web application, tools, and sources available at http://aloja.bsc.es). This article describes the evolution of the project’s focus and research lines from over a year of continuously benchmarking Hadoop under different configuration and deployments options, presents results, and discusses the motivation both technical and market-based of such changes. During this time, ALOJA’s target has evolved from a previous low-level profiling of Hadoop runtime, passing through extensive benchmarking and evaluation of a large body of results via aggregation, to currently leveraging Predictive Analytics (PA) techniques. Modeling benchmark executions allow us to estimate the results of new or untested configurations or hardware set-ups automatically, by learning techniques from past observations saving in benchmarking time and costs.

Cite

CITATION STYLE

APA

Poggi, N., Berral, J. L., & Carrera, D. (2016). ALOJA: A benchmarking and predictive platform for big data performance analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10044, pp. 71–84). Springer Verlag. https://doi.org/10.1007/978-3-319-49748-8_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free