Kudu : Storage for Fast Analytics on Fast Data ∗

  • Lipcon T
  • Alves D
  • Burkert D
 et al. 
  • 46

    Readers

    Mendeley users who have this article in their library.
  • N/A

    Citations

    Citations of this article.

Abstract

Kudu is an open source storage engine for structured data which supports low-latency random access together with ef-ficient analytical access patterns. Kudu distributes data us-ing horizontal partitioning and replicates each partition us-ing Raft consensus, providing low mean-time-to-recovery and low tail latencies. Kudu is designed within the context of the Hadoop ecosystem and supports many modes of access via tools such as Cloudera Impala[20], Apache Spark[28], and MapReduce[17].

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

There are no full text links

Authors

  • Todd Lipcon

  • David Alves

  • Dan Burkert

  • Jean-daniel Cryans

  • Adar Dembo

  • Mike Percy

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free