Tableau Prep Cookbook

By : Hendrik Kleine

Tableau Prep Cookbook

By: Hendrik Kleine

Overview of this book

Tableau Prep is a tool in the Tableau software suite, created specifically to develop data pipelines. This book will describe, in detail, a variety of scenarios that you can apply in your environment for developing, publishing, and maintaining complex Extract, Transform and Load (ETL) data pipelines. The book starts by showing you how to set up Tableau Prep Builder. You’ll learn how to obtain data from various data sources, including files, databases, and Tableau Extracts. Next, the book demonstrates how to perform data cleaning and data aggregation in Tableau Prep Builder. You’ll also gain an understanding of Tableau Prep Builder and how you can leverage it to create data pipelines that prepare your data for downstream analytics processes, including reporting and dashboard creation in Tableau. As part of a Tableau Prep flow, you’ll also explore how to use R and Python to implement data science components inside a data pipeline. In the final chapter, you’ll apply the knowledge you’ve gained to build two use cases from scratch, including a data flow for a retail store to prepare a robust dataset using multiple disparate sources and a data flow for a call center to perform ad hoc data analysis. By the end of this book, you’ll be able to create, run, and publish Tableau Prep flows and implement solutions to common problems in data pipelines.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Chapter 1: Getting Started with Tableau Prep

Technical requirements

Installing Tableau Prep Builder

Checking out the user interface

Using Tableau Prep for ad hoc data analysis

Preparing data for generic BI tools

Preparing data for Tableau Desktop ad hoc analysis

Free Chapter

Chapter 2: Extract and Load Processes

Technical requirements

Connecting to text and Excel files

Connecting to PDF files

Connecting to SAS, SPSS, and R files

Connecting to on-premises databases

Connecting to cloud databases

Connecting to Tableau extracts

Connecting to JDBC or ODBC data sources

Writing data to CSV and Hyper files

Writing data to databases

Setting up an incremental refresh

Publishing a flow to Tableau Server

Chapter 3: Cleaning Transformations

Technical requirements

Renaming columns

Filtering your dataset

Changing data types

Auto-validating data

Validating data with a custom reference list

Splitting fields with multiple values

Chapter 4: Data Aggregation

Technical requirements

Determining granularity

Aggregating values

Using fixed LOD calculations for grouping data

Grouping data

Chapter 5: Combining Data

Technical requirements

Combining data with Union

Combining data ingest and Union actions

Combining datasets using an inner join

Combining datasets using a left or right join

Expanding datasets using a full outer join

Expanding datasets using a not inner join

Chapter 6: Pivoting Data

Technical requirements

Pivoting columns to rows

Pivoting columns to rows using wildcards

Pivoting rows to columns

Chapter 7: Creating Powerful Calculations

Technical requirements

Creating calculated fields

Creating conditional calculations

Extracting substrings

Changing date formats with calculations

Creating relative temporal calculations

Creating regular expressions in calculations

Chapter 8: Data Science in Tableau Prep Builder

Technical requirements

Preparing Tableau Prep to work with R

Embedding R code in a Tableau Prep flow

Forecasting time series using R

Preparing Tableau Prep to work with Python

Embedding Python code in a Tableau Prep flow

Chapter 9: Creating Prep Flows in Various Business Scenarios

Technical requirements

Creating a flow for transaction analytics

Creating a call center flow for instant analysis

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Expanding datasets using a full outer join

In the Combining data ingest and Union actions recipe, we created an inner join to return rows from two data sources that had a commonality. In the Combining datasets using a left or right join recipe, we created a left join to return all rows from a data source and enrich that data with information from a second source, whenever there was additional information available, without dropping any rows from the original source.

In this recipe, we'll look at a variation of the join, which is named the full outer join. In this case, we'll want to retrieve all rows from both data sources involved in the join, that is, even if there's no overlap. It's essentially doing a left and right join at the same time; you won't lose any data from either data source.

In the example that follows, we'll use a use case where a company is running several projects and each project may have a number of people assigned to it. However...

Tableau Prep Cookbook

By : Hendrik Kleine

Tableau Prep Cookbook

By: Hendrik Kleine

Overview of this book

Related Content you might be interested in

Current Title:

Tableau Prep Cookbook

Data Modeling with Tableau

The Tableau Workshop

Learning Tableau 2022

Expanding datasets using a full outer join