-
Book Overview & Buying
-
Table Of Contents
Engineering Lakehouses with Open Table Formats
By :
The rise of the data lakehouse architecture has redefined how organizations manage, process, and analyze data. As open standards continue to mature, modern data engineering increasingly depends on a new class of technologies known as open table formats, such as Apache Iceberg, Apache Hudi, and Delta Lake, to bring transactional consistency, performance, and flexibility to data lakes.
Engineering Lakehouses with Open Table Formats is designed to help data engineers and architects understand, evaluate, and implement these formats in real-world environments. This book walks through the entire lakehouse journey, from understanding table format internals and transactional capabilities to building production-ready lakehouses using software such as Apache Spark, Flink, Kafka, Debezium, MLflow, and Python frameworks. It emphasizes a hands-on, engineering-focused approach with examples, architectural diagrams, and code recipes throughout.