Book Image

Data Engineering with Alteryx

By : Paul Houghton
Book Image

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.
Table of Contents (18 chapters)
Part 1: Introduction
Part 2: Functional Steps in DataOps
Part 3: Governance of DataOps

Leveraging external data sources from authenticated APIs

While accessing public datasets gives a basis for a dataset's enrichment, it is limited to publicly available information. Many more datasets available can be accessed both on a subscription and without payment, but require some form of authentication.

The specific authentication method will change from one API to another, but you will find some common themes across the different data providers:

  • API URL parameter
  • API call headers
  • OAuth authorization framework

The OAuth authorization framework is the most complex of the three examples. The complexity is because each OAuth provider can customize the process for their application. The URL parameter or call header process follows a more consistent approach across different API providers.

Because the OAuth authorization is unique to individual providers, describing a general process to follow is difficult without tying the process to a specific...