-
Book Overview & Buying
-
Table Of Contents
Engineering Lakehouses with Open Table Formats
By :
While Apache Iceberg, Apache Hudi, and Delta Lake share foundational capabilities such as ACID transactions, schema evolution, and time travel, their architectural designs and optimization strategies cater to distinct use cases. Selecting the right table format is a foundational decision in building a scalable and efficient lakehouse architecture. This choice extends far beyond storage; it dictates how data is ingested, updated, and queried, while also shaping the broader ecosystem of compute engines, metadata catalogs, and downstream applications such as BI tools and ML platforms.
This chapter should provide a systematic approach to evaluating and selecting the most appropriate table format depending on workload type and size. It will bring together some of the core strengths of Iceberg, Hudi, and Delta Lake (discussed in Chapters 3, 4, and 5), how they integrate with different ecosystems, and real-world adoption patterns that illustrate...