The big data topic will be one of the leading growth markets in information technology in the next years. One problem in this area is the efficient computation of huge data volumes, especially for complex algorithms in data mining and machine learning tasks. This paper discuss new processing frameworks for big and smart data in distributed environments and presents a benchmark between two frameworks - Apache Flink and Apache Spark - based on a mixed workload with algorithms from different analytic areas with different real-world datasets.
CITATION STYLE
Spangenberg, N., Roth, M., & Franczyk, B. (2015). Evaluating new approaches of big data analytics frameworks. In Lecture Notes in Business Information Processing (Vol. 208, pp. 28–37). Springer Verlag. https://doi.org/10.1007/978-3-319-19027-3_3
Mendeley helps you to discover research relevant for your work.