Book Image

Mastering SAS Programming for Data Warehousing

By : Monika Wahi
Book Image

Mastering SAS Programming for Data Warehousing

By: Monika Wahi

Overview of this book

SAS is used for various functions in the development and maintenance of data warehouses, thanks to its reputation of being able to handle ’big data’. This book will help you learn the pros and cons of storing data in SAS. As you progress, you’ll understand how to document and design extract-transform-load (ETL) protocols for SAS processes. Later, you’ll focus on how the use of SAS arrays and macros can help standardize ETL. The book will also help you examine approaches for serving up data using SAS and explore how connecting SAS to other systems can enhance the data warehouse user’s experience. By the end of this data management book, you will have a fundamental understanding of the roles SAS can play in a warehouse environment, and be able to choose wisely when designing your data warehousing processes involving SAS.
Table of Contents (18 chapters)
1
Section 1: Managing Data in a SAS Data Warehouse
7
Section 2: Using SAS for Extract-Transform-Load (ETL) Protocols in a Data Warehouse
12
Section 3: Using SAS When Serving Warehouse Data to Users

Loading transformed data

As described in Chapter 4, Managing ETL in SAS, in a SAS data warehouse, it's not uncommon to receive monthly or annual files that require regular ETL. Imagine receiving the BRFSS files from 2016, 2017, and 2018, and needing to process them. If the datasets are all named according to a particular naming convention, we can use macro code to automatically load the data files and put them through an ETL protocol.

The easiest way to demonstrate this with a simple exercise is to have us generate the multiple files we will later read in. That way, we can concentrate on writing the load macro, and not whether the data will cooperate. For this exercise, we will use the dataset Chap8_2. This dataset has only two variables: _STATE and FMONTH. As with the previous file, the only states included in _STATE are codes 12, 25, and 27, for Florida, Massachusetts, and Minnesota, respectively. FMONTH contains the month the survey was conducted coded as a number, 1 through...