Book Image

Mastering SAS Programming for Data Warehousing

By : Monika Wahi
Book Image

Mastering SAS Programming for Data Warehousing

By: Monika Wahi

Overview of this book

SAS is used for various functions in the development and maintenance of data warehouses, thanks to its reputation of being able to handle ’big data’. This book will help you learn the pros and cons of storing data in SAS. As you progress, you’ll understand how to document and design extract-transform-load (ETL) protocols for SAS processes. Later, you’ll focus on how the use of SAS arrays and macros can help standardize ETL. The book will also help you examine approaches for serving up data using SAS and explore how connecting SAS to other systems can enhance the data warehouse user’s experience. By the end of this data management book, you will have a fundamental understanding of the roles SAS can play in a warehouse environment, and be able to choose wisely when designing your data warehousing processes involving SAS.
Table of Contents (18 chapters)
1
Section 1: Managing Data in a SAS Data Warehouse
7
Section 2: Using SAS for Extract-Transform-Load (ETL) Protocols in a Data Warehouse
12
Section 3: Using SAS When Serving Warehouse Data to Users

Chapter 4: Managing ETL in SAS

This chapter covers issues and topics that need to be considered and addressed in the management of extract, transform, and load (ETL) functions in a SAS data warehouse. First, we discuss the different storage environments needed within the SAS data warehouse, and how staff user groups must be designated to correspond to these different storage environments so that access to them is internally controlled. Second, managing the storage of documentation for source datasets and recommendations for dataset naming conventions are covered.

Next, we describe SAS arrays, and a demonstration explaining how they are used in data steps (when performing transformation) is provided. The use of arrays can impact variable naming conventions in SAS, which are discussed with regard to using arrays in transformation code, as well as with respect to maintaining data in an evolving warehouse environment. Modular code is described and a set of code-naming conventions is...