Skip to main content

High Performance Data Processing with Spark and Kudu

  • Quinto B
Citations of this article
Mendeley users who have this article in their library.
Get full text


Kudu is just a storage engine. You need a way to get data into it and out. As Cloudera’s default big data processing framework, Spark is the ideal data processing and ingestion tool for Kudu. Not only does Spark provide excellent scalability and performance, Spark SQL and the DataFrame API make it easy to interact with Kudu.




Quinto, B. (2018). High Performance Data Processing with Spark and Kudu. In Next-Generation Big Data (pp. 159–229). Apress.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free