Book Image

Mastering SAS Programming for Data Warehousing

By : Monika Wahi
Book Image

Mastering SAS Programming for Data Warehousing

By: Monika Wahi

Overview of this book

SAS is used for various functions in the development and maintenance of data warehouses, thanks to its reputation of being able to handle ’big data’. This book will help you learn the pros and cons of storing data in SAS. As you progress, you’ll understand how to document and design extract-transform-load (ETL) protocols for SAS processes. Later, you’ll focus on how the use of SAS arrays and macros can help standardize ETL. The book will also help you examine approaches for serving up data using SAS and explore how connecting SAS to other systems can enhance the data warehouse user’s experience. By the end of this data management book, you will have a fundamental understanding of the roles SAS can play in a warehouse environment, and be able to choose wisely when designing your data warehousing processes involving SAS.
Table of Contents (18 chapters)
1
Section 1: Managing Data in a SAS Data Warehouse
7
Section 2: Using SAS for Extract-Transform-Load (ETL) Protocols in a Data Warehouse
12
Section 3: Using SAS When Serving Warehouse Data to Users

Chapter 2: Reading Big Data into SAS

This chapter will introduce SAS data warehouse developers to the issues and strategies surrounding reading big data into SAS. SAS has native data formats *.SAS7bdat and XPT, but also reads in non-native formats such as *.csv and *.txt. There are advantages and disadvantages to storing data in any of these formats, and special considerations need to be made when preparing transfers of big data in these formats. SAS warehouse developers are tasked with reading data from multiple different source systems into SAS, and this can be done using infile statements, PROC IMPORT, or a strategy that combines both techniques. Because SAS has proficiency in handling big data, SAS data warehouses often need to read in large extracts from legacy systems, many of which provide fixed-width extracts. These can be particularly challenging to read into SAS, and so this chapter also describes approaches to tackling these challenges.

This chapter takes a deep dive...