-
Book Overview & Buying
-
Table Of Contents
Building Modern Data Applications Using Databricks Lakehouse
By :
In this chapter, we covered the various ways that data lineage can be traced across datasets in the Databricks Data Intelligence Platform. We saw how the Data Lineage REST API allowed us to quickly view the upstream and downstream connections of a particular table or column in Unity Catalog. Next, we look at how easy it was to generate a lineage graph using the Catalog Explorer in Unity Catalog. The lineage graph was essential for enabling greater insight into how changes to datasets could impact downstream consumers of the dataset. Lastly, we looked at how the system tables in Unity Catalog provided a way for our organization to document the evolving flow of data asset relationships.
In the next chapter, we’ll turn our attention to deploying our data pipelines and all their dependencies in an automated fashion using tools such as Terraform.