-
Book Overview & Buying
-
Table Of Contents
Building Modern Data Applications Using Databricks Lakehouse
By :
To follow along with this chapter, it’s recommended to have Databricks workspace permissions to create an all-purpose cluster and a DLT pipeline using a cluster policy. It’s also recommended to have Unity Catalog permissions to create and use catalogs, schemas, and tables. All code samples can be downloaded from this chapter’s GitHub repository, located at https://github.com/PacktPublishing/Building-Modern-Data-Applications-Using-Databricks-Lakehouse/tree/main/chapter03. We’ll be using the NYC yellow taxi dataset, which can be found on the Databricks FileSystem at /databricks-datasets/nyctaxi/tripdata/yellow. This chapter will create and run several new notebooks and DLT pipelines using the Advanced product edition. As a result, the pipelines are estimated to consume around 10-20 Databricks Units (DBUs).