Book Image

Mastering SAS Programming for Data Warehousing

By : Monika Wahi
Book Image

Mastering SAS Programming for Data Warehousing

By: Monika Wahi

Overview of this book

SAS is used for various functions in the development and maintenance of data warehouses, thanks to its reputation of being able to handle ’big data’. This book will help you learn the pros and cons of storing data in SAS. As you progress, you’ll understand how to document and design extract-transform-load (ETL) protocols for SAS processes. Later, you’ll focus on how the use of SAS arrays and macros can help standardize ETL. The book will also help you examine approaches for serving up data using SAS and explore how connecting SAS to other systems can enhance the data warehouse user’s experience. By the end of this data management book, you will have a fundamental understanding of the roles SAS can play in a warehouse environment, and be able to choose wisely when designing your data warehousing processes involving SAS.
Table of Contents (18 chapters)
1
Section 1: Managing Data in a SAS Data Warehouse
7
Section 2: Using SAS for Extract-Transform-Load (ETL) Protocols in a Data Warehouse
12
Section 3: Using SAS When Serving Warehouse Data to Users

Chapter 6

  1. It is good to include _list at the end of an array name so that it is clear by its name that it is an array, and not a variable.

  2. Adding a condition to an array allows it to process certain parts of the input array one way, and other parts of the input array another way, depending on the criteria that's been set for the input array.

  3. The reason why it is often necessary to rename variables for arrays is that array processing goes faster if the input variables are named according to a naming convention, where each one has the same prefix and is followed by an incrementing number, such as CM1 through CM11 for the co-morbidity variables, as demonstrated in this chapter.

  4. SAS data warehouse managers need to make the following considerations about serving up index variables to users: whether the index variable was passed on from the native data provider or calculated at the SAS warehouse, how the index variable is documented, what index variables should be made...