Book Image

Data Engineering with Alteryx

By : Paul Houghton
Book Image

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.
Table of Contents (18 chapters)
Part 1: Introduction
Part 2: Functional Steps in DataOps
Part 3: Governance of DataOps


In this chapter, we have investigated several methods for saving our datasets for future use. We have seen the common methods for managing how to save files to both local destinations and network drives. These methods also included a process for ensuring that the workflow will not error when it is either shared with other people or saved to an Alteryx server.

We also investigated methods for managing database connections and how those connections can be used across different environments. We saw how the different methods of variable management and DNS connections could be used so that different databases can be targeted in different environments. This ability is especially useful when managing separate development and production environments.

Finally, we implemented these principles to save our places dataset to a Snowflake database table. We implemented the DSN connection so that when publishing to our server, a separate destination can be used in production.