Book Image

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

By : Angelo Bobak
Book Image

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

By: Angelo Bobak

Overview of this book

When organizations change or enhance their internal structures, business data integration is a complex problem that they must resolve. This book describes the common hurdles you might face while working with data integration and shows you various ways to overcome these challenges. The book begins by explaining the foundational concepts of ODS. Once familiar with schema integration, you?ll learn how to reverse engineer each data source for creating a set of data dictionary reports. These reports will provide you with the metadata necessary to apply the schema integration process. As you progress through the chapters, you will learn how to write scripts for populating the source databases and spreadsheets, as well as how to use reports to create Extract, Transform, and Load (ETL) specifications. By the end of the book, you will have the knowledge necessary to design and build a small ODS.
Table of Contents (17 chapters)
Free Chapter
1
Section 1: Site Reliability Engineering – A Prescriptive Way to Implement DevOps
6
Section 2: Google Cloud Services to Implement DevOps via CI/CD
Appendix: Getting Ready for Professional Cloud DevOps Engineer Certification

Cloud Deployment Manager

Infrastructure as Code (IaC) is the process of managing and provisioning infrastructure through code instead of manually creating the required resources. Cloud Deployment Manager is a Google Cloud service that provides IaC. Cloud Deployment Manager can create a set of Google Cloud resources and facilitates managing these resources as a unit otherwise called a deployment. For example, it is possible to create a Virtual Private Cloud (VPC) using declarative code through a configuration file rather than manually creating it through the console. The following are some critical properties of Cloud Deployment Manager:

  • Can create multiple resources in parallel, such as multiple VMs
  • Can provide input variables to create a resource with specific user-defined values as required
  • Can get the return value of a newly created resource, such as the instance ID of a newly created Google Compute Engine instance
  • Can create dependencies where one resource definition...