-
Book Overview & Buying
-
Table Of Contents
Engineering Lakehouses with Open Table Formats
By :
In this section, we will explore the defining attributes that give open data lakehouse architecture its edge for today’s analytical workloads. These attributes form the foundation of how an open data lakehouse architecture delivers openness, reliability, and cost-efficiency, enabling running multiple analytical workloads on the same architecture. As we explore each attribute, you'll see how they come together to optimize data management, reduce costs, and ensure seamless integration across various workloads and compute engines.
One of the most important characteristics of a lakehouse architecture is its emphasis on an open data architecture. In today’s context, "openness" refers to open standards and the open way of storing data. This is facilitated by the metadata layer (table formats such as Apache Iceberg, Apache Hudi, and Delta Lake) in combination with the data layer (open file formats such...