Kudu : Storage for Fast Analytics on Fast Data ∗

Todd Lipcon; David Alves; Dan Burkert; Jean-daniel Cryans; Adar Dembo; Mike Percy; Silvius Rus; Dave Wang; Matteo Bertozzi; Colin Patrick Mccabe; Andrew Wang

Journal Article

Kudu : Storage for Fast Analytics on Fast Data ∗

Lipcon T
Alves D
Burkert D
et al.

Draft (2015)

N/ACitations

91Readers

Abstract

Kudu is an open source storage engine for structured data which supports low-latency random access together with ef-ficient analytical access patterns. Kudu distributes data us-ing horizontal partitioning and replicates each partition us-ing Raft consensus, providing low mean-time-to-recovery and low tail latencies. Kudu is designed within the context of the Hadoop ecosystem and supports many modes of access via tools such as Cloudera Impala[20], Apache Spark[28], and MapReduce[17].

Cite

CITATION STYLE

APA

Lipcon, T., Alves, D., Burkert, D., Cryans, J., Dembo, A., Percy, M., … Wang, A. (2015). Kudu : Storage for Fast Analytics on Fast Data ∗. Draft.

Kudu : Storage for Fast Analytics on Fast Data ∗

Abstract

Cite

Register to see more suggestions