In this article, we present the specification of BigBench, an end-to-end big data benchmark proposal. BigBench models a retail product supplier. The benchmark proposal covers a data model and a set of big data specific queries. BigBench's synthetic data generator addresses the variety, velocity and volume aspects of big data workloads. The structured part of the BigBench data model is adopted from the TPC-DS benchmark. In addition, the structured schema is enriched with semi-structured and unstructured data components that are common in a retail product supplier environment. This specification contains the full query set as well as the data model. © 2014 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Rabl, T., Ghazal, A., Hu, M., Crolotte, A., Raab, F., Poess, M., & Jacobsen, H. A. (2014). BigBench specification V0.1 BigBench: An industry standard benchmark for big data analytics. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8163 LNCS, pp. 164–201). Springer Verlag. https://doi.org/10.1007/978-3-642-53974-9_14
Mendeley helps you to discover research relevant for your work.