Episode 398: Apache Kudu with Adar Lieber-Dembo

Filed in Episodes by on February 12, 2020 0 Comments
Facebooktwitterlinkedin

Adar Lieber-DemboAdar Lieber-Dembo from Cloudera discusses Apache Kudu, which is a columnar data storage system for fast analytics and fast ingestion of large datasets. Kudu takes its inspiration from systems in the Hadoop ecosystem, but it addresses many of their shortcomings. SE Radio’s Akshay Manchale spoke with Adar about motivations behind building Kudu, features available for users to ingest and query data, and operational aspects of running Kudu. They also talked about special features such as partitioning and distributing data in a Kudu cluster, features for high availability, and HybridTime and integration of Kudu with other data analysis and SQL engines. The interview ends with a brief discussion of the advantages of column-based storage in databases.

Related Links

Related Episodes


 

SE Radio theme: “Broken Reality” by Kevin MacLeod (incompetech.com — Licensed under Creative Commons: By Attribution 3.0)

Tags: , , , , , , , , , , , , , , , ,