Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Driving Data Quality with Data Contracts
  • Table Of Contents Toc
  • Feedback & Rating feedback
Driving Data Quality with Data Contracts

Driving Data Quality with Data Contracts

By : Andrew Jones
4.8 (11)
close
close
Driving Data Quality with Data Contracts

Driving Data Quality with Data Contracts

4.8 (11)
By: Andrew Jones

Overview of this book

Despite the passage of time and the evolution of technology and architecture, the challenges we face in building data platforms persist. Our data often remains unreliable, lacks trust, and fails to deliver the promised value. With Driving Data Quality with Data Contracts, you’ll discover the potential of data contracts to transform how you build your data platforms, finally overcoming these enduring problems. You’ll learn how establishing contracts as the interface allows you to explicitly assign responsibility and accountability of the data to those who know it best—the data generators—and give them the autonomy to generate and manage data as required. The book will show you how data contracts ensure that consumers get quality data with clearly defined expectations, enabling them to build on that data with confidence to deliver valuable analytics, performant ML models, and trusted data-driven products. By the end of this book, you’ll have gained a comprehensive understanding of how data contracts can revolutionize your organization’s data culture and provide a competitive advantage by unlocking the real value within your data.
Table of Contents (16 chapters)
close
close
1
Part 1: Why Data Contracts?
4
Part 2: Driving Data Culture Change with Data Contracts
8
Part 3: Designing and Implementing a Data Architecture Based on Data Contracts

Providing the interfaces to the data

In this section, we’ll use the data contract to provision a Google BigQuery table. This will act as the interface to the data, through which the data generators will make their data available to the data consumers. We’ll learn how to use the contract and its schema to dynamically provision and manage those resources, keeping them in sync with the data contract.

This BigQuery table is the first of our contract-driven resources. To create it, we’ll need to convert our data contract to a custom JSON format that defines a BigQuery table and its schema (https://cloud.google.com/bigquery/docs/reference/rest/v2/tables#TableSchema), as highlighted in the following diagram:

Figure 8.2 – Using the data contract to define and create a BigQuery table

Figure 8.2 – Using the data contract to define and create a BigQuery table

We’ll also need a way to send that JSON to the Google Cloud APIs, which will then create the table. To do that, we are going to make use of an...

Visually different images
CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Driving Data Quality with Data Contracts
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon