Book Image

Mastering SAS Programming for Data Warehousing

By : Monika Wahi
Book Image

Mastering SAS Programming for Data Warehousing

By: Monika Wahi

Overview of this book

SAS is used for various functions in the development and maintenance of data warehouses, thanks to its reputation of being able to handle ’big data’. This book will help you learn the pros and cons of storing data in SAS. As you progress, you’ll understand how to document and design extract-transform-load (ETL) protocols for SAS processes. Later, you’ll focus on how the use of SAS arrays and macros can help standardize ETL. The book will also help you examine approaches for serving up data using SAS and explore how connecting SAS to other systems can enhance the data warehouse user’s experience. By the end of this data management book, you will have a fundamental understanding of the roles SAS can play in a warehouse environment, and be able to choose wisely when designing your data warehousing processes involving SAS.
Table of Contents (18 chapters)
1
Section 1: Managing Data in a SAS Data Warehouse
7
Section 2: Using SAS for Extract-Transform-Load (ETL) Protocols in a Data Warehouse
12
Section 3: Using SAS When Serving Warehouse Data to Users

Questions

  1. How does an analyst tell what the variable names mean and what the coded levels for categorical variables mean in a dataset?

  2. PROC FREQ is for creating frequency tables about categorical variables, and PROC UNIVARIATE is for producing summary statistics about continuous variables. Therefore, why would a person preparing data for loading into a data warehouse ever use PROC FREQ on a continuous variable?

  3. Why is it helpful to plan transformed variables in a data dictionary before developing ETL code?

  4. How does suppressing values as missing in a continuous variable impact PROC UNIVARIATE output?

  5. Imagine you were working on a data warehouse with stock market data. In your data warehouse, you had the value of the stock market at the time of closing every day. What are some classification variables you could make that might improve how the users of the data warehouse were served?

  6. Imagine you had some weather data from a tropical region. After checking a data...