Book Image

Data Engineering with Alteryx

By : Paul Houghton
Book Image

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.
Table of Contents (18 chapters)
1
Part 1: Introduction
5
Part 2: Functional Steps in DataOps
11
Part 3: Governance of DataOps

Summary

In this chapter, we have looked at connecting to different data sources. First, we investigated connecting to local and network files and learned the principles for connecting to databases with ODBC connections. We also investigated enriching our internal data sources with public and authenticated APIs. Finally, we saw how to use the Download tool to access the data resources and download them to Alteryx.

Once we had the data source, we performed some initial data cleansing so that our dataset could be used quickly. We can deliver on the Improve Cycle Times principle by getting the initial dataset to our customers so that they can validate what we have acquired.

The next chapter will investigate how we can iterate over the dataset to improve it for better use and transform it into an excellent dataset for our requirements.