Chapter 5: Data Processing and Transformations | Data Engineering with Alteryx

Book Overview & Buying
Table Of Contents

Data Engineering with Alteryx

By : Paul Houghton

4.8 (11)

Buy this Book

Data Engineering with Alteryx

4.8 (11)

By: Paul Houghton

Buy this Book

Overview of this book

Alteryx is a GUI-based development platform for data analytic applications. Data Engineering with Alteryx will help you leverage Alteryx’s code-free aspects which increase development speed while still enabling you to make the most of the code-based skills you have. This book will teach you the principles of DataOps and how they can be used with the Alteryx software stack. You’ll build data pipelines with Alteryx Designer and incorporate the error handling and data validation needed for reliable datasets. Next, you’ll take the data pipeline from raw data, transform it into a robust dataset, and publish it to Alteryx Server following a continuous integration process. By the end of this Alteryx book, you’ll be able to build systems for validating datasets, monitoring workflow performance, managing access, and promoting the use of your data sources.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example workflow files

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Part 1: Introduction

Free Chapter

Chapter 1: Getting Started with Alteryx

Understanding the Alteryx platform

Using Alteryx Designer

Leveraging Alteryx Server and Alteryx Connect

Using this book in your data engineering work

Summary

Chapter 2: Data Engineering with Alteryx

What is a data engineer?

Using Alteryx products as a data engineer

Applying DataOps as an Alteryx data engineer

Summary

Chapter 3: DataOps and Its Benefits

The benefits the DataOps framework brings to your organization

Understanding DataOps principles

Applying DataOps to Alteryx

Using Alteryx software with DataOps

General steps for deploying DataOps in your environment

Summary

Part 2: Functional Steps in DataOps

Chapter 4: Sourcing the Data

Technical requirements

Accessing internal data sources

Integrating public data sources with Download tool use

Leveraging external data sources from authenticated APIs

Initial cleansing of datasets

Constructing a data pipeline in Alteryx Designer

Summary

Chapter 5: Data Processing and Transformations

Technical requirements

The data cleansing process

Profiling data with summary and statistical aggregations

Transforming our data pipeline

Summary

Chapter 6: Destination Management

Technical requirements

Writing to destinations

Managing database connections

Accessing more data sources with custom connections

Integrating data pipelines across environments

Publishing the external data to a Snowflake destination

Summary

Chapter 7: Extracting Value

Technical requirements

Exploratory data analysis in Alteryx and surfacing the datasets for BI tools

Using Alteryx to deliver standard reports

Summary

Chapter 8: Beginning Advanced Analytics

Technical requirements

Implementing spatial analytics with Alteryx

Beginning the ML process in Alteryx

Summary

Part 3: Governance of DataOps

Chapter 9: Testing Workflows and Outputs

Technical requirements

Workflow tests and messages

Validating data outputs

Centralizing the monitoring outputs with Insights

Summary

Chapter 10: Monitoring DataOps and Managing Changes

Technical requirements

Using the Alteryx Server monitoring workflow

Creating an insight dashboard for workflow monitoring

Exporting the MongoDB database for custom analysis

Using Git and GitHub Actions for continuous integration

Summary

Chapter 11: Securing and Managing Access

Technical requirements

Organizing content on Alteryx Server

Managing collections

Securing the data environment

Summary

Chapter 12: Making Data Easy to Use and Discoverable with Alteryx

Technical requirements

What is Alteryx Connect, and how does it help DataOps?

Publishing the data lineage to Alteryx Connect

Data nexus

Syncing the Connect data dictionary with other data catalogs

Summary

Chapter 13: Conclusion

The Alteryx data engineer

The functional steps in DataOps

Governance of DataOps with Alteryx

Our Alteryx data pipeline

Final summary

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Data Engineering with Alteryx

By : Paul Houghton

Data Engineering with Alteryx

By: Paul Houghton

Overview of this book

The data cleansing process

Selecting columns

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access