Book Image

Data Engineering with Alteryx

By : Paul Houghton
Book Image

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.
Table of Contents (18 chapters)
1
Part 1: Introduction
5
Part 2: Functional Steps in DataOps
11
Part 3: Governance of DataOps

Chapter 12: Making Data Easy to Use and Discoverable with Alteryx

Building a data pipeline and a dataset provides the foundation for data operations. Getting to the dataset solves the initial request that a data engineer will receive, but it will always leave end users with questions about the dataset you have created. Questions such as What does the field xyz represent? Or, How was this metric calculated? You can also have a situation where, because the dataset is unknown to most users in your organization, you get duplicate requests for the same data, or the dataset gets recreated, often with slight differences. This divergence in datasets leads to different teams reporting different results for the same question. Alteryx Connect was created to solve the data duplication and discovery challenge.

In this chapter, we will cover the following topics:

  • What is Alteryx Connect, and how does it help DataOps?
  • Publishing the data lineage to Alteryx Connect
  • Syncing the...