Hands-On SAS for Data Analysis

By : Harish Gulati

Hands-On SAS for Data Analysis

By: Harish Gulati

Overview of this book

SAS is one of the leading enterprise tools in the world today when it comes to data management and analysis. It enables the fast and easy processing of data and helps you gain valuable business insights for effective decision-making. This book will serve as a comprehensive guide that will prepare you for the SAS certification exam. After a quick overview of the SAS architecture and components, the book will take you through the different approaches to importing and reading data from different sources using SAS. You will then cover SAS Base and 4GL, understanding data management and analysis, along with exploring SAS functions for data manipulation and transformation. Next, you'll discover SQL procedures and get up to speed on creating and validating queries. In the concluding chapters, you'll learn all about data visualization, right from creating bar charts and sample geographic maps through to assigning patterns and formats. In addition to this, the book will focus on macro programming and its advanced aspects. By the end of this book, you will be well versed in SAS programming and have the skills you need to easily handle and manage your data-related problems in SAS.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Section 1: SAS Basics

Introduction to SAS Programming

SAS dataset fundamentals

SAS programming language – basic syntax

SAS LOG

Formats

Summary

Data Manipulation and Transformation

Length of a variable

Case conversion and alignment

String identification

Dealing with blanks

Missing and multiple values

Interval calculations

Concatenation

Logic and control

Number manipulation

Summary

Section 2: Merging, Optimizing, and Descriptive Statistics

Combining, Indexing, Encryption, and Compression Techniques Simplified

Introduction to combining

Merging

Summary

Power of Statistics, Reporting, Transforming Procedures, and Functions

Proc Freq

Proc Univariate

Proc Means and Summary

Proc Corr

Proc REG

Proc Transpose

Summary

Section 3: Advanced Programming

Advanced Programming Techniques - SAS Macros

What are macros?

Macro variable processing

Macro resolution tracking

Macro definition processing

Comparing positional and keywords parameters

Data-driven programming

Leveraging automatic global macro variables

Macros that evaluate

Writing efficient macros

Summary

Powerful Functions, Options, and Automatic Variables Simplified

NOMPREPLACE and MREPLACE

NOMCOMPILE and NCOMPILE

MCOMPILENOTE

NOMEXECNOTE and MEXECNOTE

MAUTOCOMPLOC

MACRO and NOMACRO

Exchanging values between the DATA step and macro variables

CALL EXECUTE

Altering the CALL SYMPUT example

Resolving macro variables

Macro quoting

Summary

Section 4: SQL in SAS

Advanced Programming Techniques Using PROC SQL

Comparing data steps and Proc SQL

Proc SQL joins

Proc SQL essentials

Dictionary tables

Summary

Deep Dive into PROC SQL

SAS views in Proc SQL

Making changes with Proc SQL

Identifying duplicates using Proc SQL

Creating an index in Proc SQL

Macros and Proc SQL

Summary

Section 5: Data Visualization and Reporting

Data Visualization

The role of data visualization in analytics

Histograms

Line plots

Vertical and horizontal bar charts

Scatter charts

Box plot

Summary

Reporting and Output Delivery System

Proc Tabulate

Specifying the ODS destination

Formatting ODS files

ODS Excel charts

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Indexing

We broached the subject of creating an index file in previous chapters. In this section, we will describe how the indexes are stored and retrieved. The index file consists of entries that are organized hierarchically and connected by pointers, all of which are maintained by SAS. The lowest level in the index file hierarchy consists of entries that represent each distinct value for an indexed variable, in ascending value order. Each entry contains the following information:

A distinct value
One or more unique record identifiers (referred to as a RID) that identify each observation that contains the value

If we created an index for the City variable using the AC dataset in the mismatch dataset, we would have an index file with entries such as the following:

Value	RID
Adelaide	1
Copenhagen	2
Hong Kong	3, 4, 5, 6
hong Kong	7, 8, 9

Let...

Hands-On SAS for Data Analysis

By : Harish Gulati

Hands-On SAS for Data Analysis

By: Harish Gulati

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On SAS for Data Analysis

Big Data Analytics with SAS

Mastering SAS Programming for Data Warehousing

SAS for Finance