General-Purpose Big Data Processing Systems

Sherif Sakr

Book Chapter

General-Purpose Big Data Processing Systems

Sakr S

Springer, (2016), 15-39

DOI: 10.1007/978-3-319-38776-5_2

5Citations

12Readers

Get full text

Abstract

In 2004, Google introduced the MapReduce framework as a simple and powerful programming model that enables the easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines (Dean and Ghemawa, OSDI, 2004, [20]). In particular, the implementation described in the original paper is mainly designed to achieve high performance on large clusters of commodity PCs. One of the main advantages of this approach is that it isolates the application from the details of running a distributed program, such as issues on data distribution, scheduling, and fault tolerance. In this model, the computation takes a set of key-value pairs as input and produces a set of key-value pairs as output.

Author supplied keywords

Cite

CITATION STYLE

APA

Sakr, S. (2016). General-Purpose Big Data Processing Systems. In SpringerBriefs in Computer Science (Vol. 0, pp. 15–39). Springer. https://doi.org/10.1007/978-3-319-38776-5_2

General-Purpose Big Data Processing Systems

Abstract

Author supplied keywords

Cite

Register to see more suggestions