Book Image

Mastering Tableau 2019.1 - Second Edition

By : Marleen Meier, David Baldwin
Book Image

Mastering Tableau 2019.1 - Second Edition

By: Marleen Meier, David Baldwin

Overview of this book

Tableau is one of the leading business intelligence (BI) tools used to solve BI and analytics challenges. With this book, you will master Tableau's features and offerings in various paradigms of the BI domain. This book is also the second edition of the popular Mastering Tableau series, with new features, examples, and updated code. The book covers essential Tableau concepts and its advanced functionalities. Using Tableau Hyper and Tableau Prep, you’ll be able to handle and prepare data easily. You’ll gear up to perform complex joins, spatial joins, union, and data blending tasks using practical examples. Following this, you’ll learn how to perform data densification to make displaying granular data easier. Next, you’ll explore expert-level examples to help you with advanced calculations, mapping, and visual design using various Tableau extensions. With the help of examples, you’ll also learn about improving dashboard performance, connecting Tableau Server, and understanding data visualizations. In the final chapters, you’ll cover advanced use cases such as Self-Service Analytics, Time Series Analytics, and Geo-Spatial Analytics, and learn to connect Tableau to R, Python, and MATLAB. By the end of this book, you’ll have mastered the advanced offerings of Tableau and be able to tackle common and not-so-common challenges faced in the BI domain.
Table of Contents (20 chapters)
Free Chapter
1
Section 1: Tableau Concepts, Basics
9
Section 2: Advanced Calculations, Mapping, Visualizations
16
Section 3: Connecting Tableau to R, Python, and Matlab

Introduction to clustering

Clustering is used to select smaller subsets of data with members sharing similar characteristics from a larger dataset. As an example, consider a marketing scenario. You have a large customer base to which you plan to send advertising material; however, cost prohibits you from sending material to every customer. Performing clustering on the dataset will return groupings of customers with similar characteristics. You can then survey the results and choose a target group.

Major methods for clustering include hierarchical and K-means. Hierarchical clustering is more thorough and thus more time-consuming. It generates a series of models that range from 1, which includes all data points, to n, where each data point is an individual model. K-means clustering is a quicker method in which the user or another function defines the number of clusters. For example...