Book Image

Data Engineering with Alteryx

By : Paul Houghton
Book Image

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.
Table of Contents (18 chapters)
1
Part 1: Introduction
5
Part 2: Functional Steps in DataOps
11
Part 3: Governance of DataOps

Publishing the data lineage to Alteryx Connect

We have been looking at a data catalog where all the individual assets, such as the tables or fields, have the information already populated. For this information to be available, you need to populate the data asset information into Connect. There are three methods for populating the Connect data assets:

  • Loading metadata directly from Connect
  • Using the prebuilt workflow apps
  • Creating a custom-built data source with the Connect APIs

These methods allow you to populate your Connect data dictionary with the information that exists in your data asset. For databases, Alteryx will extract any comment information in addition to tables, field names, and field types. Let's take a look at them one by one.

Loading metadata directly from Connect

The first method for loading metadata is directly from Alteryx Connect. To initialize the metadata collection from Connect, there are two prerequisites:

  • You must be...