Book Image

Data Engineering with Alteryx

By : Paul Houghton
Book Image

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.
Table of Contents (18 chapters)
1
Part 1: Introduction
5
Part 2: Functional Steps in DataOps
11
Part 3: Governance of DataOps

The Alteryx data engineer

The Alteryx Software stack, comprising Alteryx Designer, Alteryx Server, and Alteryx Connect, is the core tool an Alteryx data engineer uses to build, deploy, and publicize the datasets and data pipelines created. Following an iterative development process, a data engineer quickly finds a dataset, transforms the dataset, and then provides the dataset to end users.

In Chapter 1, Getting Started, we introduced the components of the Alteryx platform and demonstrated their use in a simple example. We also introduced the working definition of data engineering as follows:

Data engineering is the process of taking data from any number of disparate sources and transforming it into a usable format for an end user.

This definition provided a starting point for building a process for accessing data sources and making them a valuable resource for our end users. Using Alteryx provided three key benefits for you as a data engineer:

  • Speed of development
  • ...