Book Image

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

By : Angelo Bobak
Book Image

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

By: Angelo Bobak

Overview of this book

When organizations change or enhance their internal structures, business data integration is a complex problem that they must resolve. This book describes the common hurdles you might face while working with data integration and shows you various ways to overcome these challenges. The book begins by explaining the foundational concepts of ODS. Once familiar with schema integration, you?ll learn how to reverse engineer each data source for creating a set of data dictionary reports. These reports will provide you with the metadata necessary to apply the schema integration process. As you progress through the chapters, you will learn how to write scripts for populating the source databases and spreadsheets, as well as how to use reports to create Extract, Transform, and Load (ETL) specifications. By the end of the book, you will have the knowledge necessary to design and build a small ODS.
Table of Contents (17 chapters)
Free Chapter
1
Section 1: Site Reliability Engineering – A Prescriptive Way to Implement DevOps
6
Section 2: Google Cloud Services to Implement DevOps via CI/CD
Appendix: Getting Ready for Professional Cloud DevOps Engineer Certification

Points to remember

The following are some important points to remember:

  • GKE is fully managed, uses a container-optimized OS, and supports autoscaling, the auto-repair of nodes, and auto-upgrades.
  • GKE supports two modes of operations – Standard and Autopilot.
  • GKE Standard mode supports VPC-native traffic routing and HTTP load balancing as default options.
  • Cloud operations for GKE are enabled as a default setting.
  • A private Kubernetes engine cluster cannot be accessed publicly.
  • A node pool represents a group of nodes with the same configuration.
  • By default, a new node pool runs the latest Kubernetes version and can be configured for auto-upgrade or can be manually upgraded.
  • Node pools in a regional or multi-zonal cluster are replicated to multiple zones.
  • A multi-zonal cluster will only have a single replica of the control plane.
  • A regional cluster has multiple replicas of the control plane running across multiple zones in a region.
  • ...