Book Image

Data Modeling with Snowflake

By : Serge Gershkovich

5 (2)

Book Image

Data Modeling with Snowflake

5 (2)

By: Serge Gershkovich

Overview of this book

The Snowflake Data Cloud is one of the fastest-growing platforms for data warehousing and application workloads. Snowflake's scalable, cloud-native architecture and expansive set of features and objects enables you to deliver data solutions quicker than ever before. Yet, we must ensure that these solutions are developed using recommended design patterns and accompanied by documentation that’s easily accessible to everyone in the organization. This book will help you get familiar with simple and practical data modeling frameworks that accelerate agile design and evolve with the project from concept to code. These universal principles have helped guide database design for decades, and this book pairs them with unique Snowflake-native objects and examples like never before – giving you a two-for-one crash course in theory as well as direct application. By the end of this Snowflake book, you’ll have learned how to leverage Snowflake’s innovative features, such as time travel, zero-copy cloning, and change-data-capture, to create cost-effective, efficient designs through time-tested modeling principles that are easily digestible when coupled with real-world examples.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Share Your Thoughts

Download a free PDF copy of this book

Part 1: Core Concepts in Data Modeling and Snowflake Architecture

Part 1: Core Concepts in Data Modeling and Snowflake Architecture

Free Chapter

Chapter 1: Unlocking the Power of Modeling

Chapter 1: Unlocking the Power of Modeling

Technical requirements

Modeling with purpose

Leveraging the modeling toolkit

The benefits of database modeling

Operational and analytical modeling scenarios

A look at relational and transformational modeling

Further reading

Chapter 2: An Introduction to the Four Modeling Types

Chapter 2: An Introduction to the Four Modeling Types

Design and process

Ubiquitous modeling

Physical modeling

Transformational

Further reading

Chapter 3: Mastering Snowflake’s Architecture

Chapter 3: Mastering Snowflake’s Architecture

Traditional architectures

Snowflake’s solution

Snowflake’s three-tier architecture

Snowflake’s features

Costs to consider

Saving cash by using cache

Further reading

Chapter 4: Mastering Snowflake Objects

Chapter 4: Mastering Snowflake Objects

Snowflake views

Materialized views

Change tracking

Chapter 5: Speaking Modeling through Snowflake Objects

Chapter 5: Speaking Modeling through Snowflake Objects

Entities as tables

Attributes as columns

Constraints and enforcement

Identifiers as primary keys

Alternate keys as unique constraints

Relationships as foreign keys

Mandatory columns as NOT NULL constraints

Chapter 6: Seeing Snowflake’s Architecture through Modeling Notation

Chapter 6: Seeing Snowflake’s Architecture through Modeling Notation

A history of relational modeling

RM versus entity-relationship diagram

Visual modeling conventions

The benefit of synchronized modeling

Part 2: Applied Modeling from Idea to Deployment

Part 2: Applied Modeling from Idea to Deployment

Chapter 7: Putting Conceptual Modeling into Practice

Chapter 7: Putting Conceptual Modeling into Practice

Embarking on conceptual design

Modeling in reverse

Further reading

Chapter 8: Putting Logical Modeling into Practice

Chapter 8: Putting Logical Modeling into Practice

Expanding from conceptual to logical modeling

Adding attributes

Cementing the relationships

Chapter 9: Database Normalization

Chapter 9: Database Normalization

An overview of database normalization

Database normalization through examples

Data models on a spectrum of normalization

Chapter 10: Database Naming and Structure

Chapter 10: Database Naming and Structure

Naming conventions

Organizing a Snowflake database

Chapter 11: Putting Physical Modeling into Practice

Chapter 11: Putting Physical Modeling into Practice

Technical requirements

Considerations before starting the implementation

Expanding from logical to physical modeling

Deploying a physical model

Creating an ERD from a physical model

Part 3: Solving Real-World Problems with Transformational Modeling

Part 3: Solving Real-World Problems with Transformational Modeling

Chapter 12: Putting Transformational Modeling into Practice

Chapter 12: Putting Transformational Modeling into Practice

Technical requirements

Separating the model from the object

Shaping transformations through relationships

Join elimination using constraints

Joins and set operators

Performance considerations and monitoring

Putting transformational modeling into practice

Chapter 13: Modeling Slowly Changing Dimensions

Chapter 13: Modeling Slowly Changing Dimensions

Technical requirements

Dimensions overview

Recipes for maintaining SCDs in Snowflake

Chapter 14: Modeling Facts for Rapid Analysis

Chapter 14: Modeling Facts for Rapid Analysis

Technical requirements

Fact table types

Fact table measures

Getting the facts straight

Maintaining fact tables using Snowflake features

Chapter 15: Modeling Semi-Structured Data

Chapter 15: Modeling Semi-Structured Data

Technical requirements

The benefits of semi-structured data in Snowflake

Getting hands-on with semi-structured data

Schema-on-read != schema-no-need

Converting semi-structured data into relational data

Chapter 16: Modeling Hierarchies

Chapter 16: Modeling Hierarchies

Technical requirements

Understanding and distinguishing between hierarchies

Maintaining hierarchies in Snowflake

Chapter 17: Scaling Data Models through Modern Techniques

Chapter 17: Scaling Data Models through Modern Techniques

Technical requirements

Demystifying Data Vault 2.0

Modeling the data marts

Discovering Data Mesh

Index

Other Books You May Enjoy

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Appendix

Technical requirements

The exceptional time traveler

The secret column type Snowflake refuses to document

Read the functional manual (RTFM)

Customer Reviews

5 (2)

5 star

100%

4 star

0

3 star

0

2 star

0

1 star

0

Getting hands-on with semi-structured data

Although we will query semi-structured JSON data as part of this exercise, its storage still conforms to modeling best practices such as naming and standard columns. In this example, we will use semi-structured data containing information about pirates – such as details about the crew, weapons, and their ship – all stored in a single VARIANT data type. With relational data, a row represents a single entity; in semi-structured data, a row is an entire file (although the file itself can contain single or countless entities). For this reason, metadata columns to mark individual loads and source filenames are stored alongside VARIANT.

Figure 15.1 – A table with ELT meta columns and VARIANT for storing semi-structured data

Figure 15.1 – A table with ELT meta columns and VARIANT for storing semi-structured data

This example uses AUTOINCREMENT (a.k.a. IDENTITY) as the default to generate a sequential unique ID for each load/record inserted.

In a real-world scenario, semi-structured...