Book Image

Mastering SAS Programming for Data Warehousing

By : Monika Wahi
Book Image

Mastering SAS Programming for Data Warehousing

By: Monika Wahi

Overview of this book

SAS is used for various functions in the development and maintenance of data warehouses, thanks to its reputation of being able to handle ’big data’. This book will help you learn the pros and cons of storing data in SAS. As you progress, you’ll understand how to document and design extract-transform-load (ETL) protocols for SAS processes. Later, you’ll focus on how the use of SAS arrays and macros can help standardize ETL. The book will also help you examine approaches for serving up data using SAS and explore how connecting SAS to other systems can enhance the data warehouse user’s experience. By the end of this data management book, you will have a fundamental understanding of the roles SAS can play in a warehouse environment, and be able to choose wisely when designing your data warehousing processes involving SAS.
Table of Contents (18 chapters)
1
Section 1: Managing Data in a SAS Data Warehouse
7
Section 2: Using SAS for Extract-Transform-Load (ETL) Protocols in a Data Warehouse
12
Section 3: Using SAS When Serving Warehouse Data to Users

Chapter 5

  1. PROC TABULATE is for developing well-formatted tabular reports.

  2. The ODS allows the programmer to tell SAS which internal tables generated during the PROC should be saved outside the PROC, and where to save them.

  3. No special code is needed. For the PROCs that use GTL – PROC SGPLOT, PROC SGPANEL, PROC SGSCATTER, and PROC TEMPLATE used with PROC SGRENDER – the ODS is automatically deployed when running the PROC.

  4. Since the department leader wants to write paragraphs next to the tables, the best format in which to produce the SAS report would be *.rtf, which opens in Microsoft Word. PROC TABULATE could be used to format the tables as per the department leader's specifications.

  5. To output a list of variable names using the ODS, first, the programmer would need to recall what PROC would produce in its output. PROC CONTENTS is the commonly used PROC for outputting a list of variables in a dataset. Once the programmer identifies the appropriate...