Book Image

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

By : Angelo Bobak
Book Image

Connecting the Data: Data Integration Techniques for Building an Operational Data Store (ODS)

By: Angelo Bobak

Overview of this book

When organizations change or enhance their internal structures, business data integration is a complex problem that they must resolve. This book describes the common hurdles you might face while working with data integration and shows you various ways to overcome these challenges. The book begins by explaining the foundational concepts of ODS. Once familiar with schema integration, you?ll learn how to reverse engineer each data source for creating a set of data dictionary reports. These reports will provide you with the metadata necessary to apply the schema integration process. As you progress through the chapters, you will learn how to write scripts for populating the source databases and spreadsheets, as well as how to use reports to create Extract, Transform, and Load (ETL) specifications. By the end of the book, you will have the knowledge necessary to design and build a small ODS.
Table of Contents (17 chapters)
Free Chapter
1
Section 1: Site Reliability Engineering – A Prescriptive Way to Implement DevOps
6
Section 2: Google Cloud Services to Implement DevOps via CI/CD
Appendix: Getting Ready for Professional Cloud DevOps Engineer Certification

Kubernetes objects

A Kubernetes object is a persistent entity and represents a record of intent. An object can be defined using the YAML configuration. It will have two main fields – spec and status. The object spec represents the specification, and the object state represents the desired state. Once the object is created, the Kubernetes system will ensure that the object exists as per the specified declarative configuration.

Kubernetes supports multiple object types. Each object type is meant for a specific purpose. The following are some critical Kubernetes objects that will be used throughout this chapter. This is not an exhaustive list:

  • Pods – The smallest atomic unit in Kubernetes
  • Deployment – Provides declarative updates for Pods and ReplicaSets
  • StatefulSet – Manages stateful applications and guarantees ordering
  • DaemonSet – Runs a copy of the Pod on each node
  • Job – Creates one or more Pods and will continue...