Spark is the next-generation big data processing framework for processing and analyzing large data sets. Spark features a unified processing framework that provides high-level APIs in Scala, Python, Java, and R and powerful libraries including Spark SQL for SQL support, MLlib for machine learning, Spark Streaming for real-time streaming, and GraphX for graph processing.i Spark was founded by Matei Zaharia at the University of California, Berkeley’s AMPLab and was later donated to the Apache Software Foundation, becoming a top-level project in February 24, 2014.ii The first version was released on May 30, 2017.iii
CITATION STYLE
Quinto, B. (2018). Introduction to Spark. In Next-Generation Big Data (pp. 113–158). Apress. https://doi.org/10.1007/978-1-4842-3147-0_5
Mendeley helps you to discover research relevant for your work.