Book Overview & Buying
Table Of Contents

Hands-On SAS for Data Analysis

By : Harish Gulati

Buy this Book

Hands-On SAS for Data Analysis

By: Harish Gulati

Buy this Book

Overview of this book

SAS is one of the leading enterprise tools in the world today when it comes to data management and analysis. It enables the fast and easy processing of data and helps you gain valuable business insights for effective decision-making. This book will serve as a comprehensive guide that will prepare you for the SAS certification exam. After a quick overview of the SAS architecture and components, the book will take you through the different approaches to importing and reading data from different sources using SAS. You will then cover SAS Base and 4GL, understanding data management and analysis, along with exploring SAS functions for data manipulation and transformation. Next, you'll discover SQL procedures and get up to speed on creating and validating queries. In the concluding chapters, you'll learn all about data visualization, right from creating bar charts and sample geographic maps through to assigning patterns and formats. In addition to this, the book will focus on macro programming and its advanced aspects. By the end of this book, you will be well versed in SAS programming and have the skills you need to easily handle and manage your data-related problems in SAS.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Section 1: SAS Basics

Introduction to SAS Programming

SAS dataset fundamentals

SAS programming language – basic syntax

SAS LOG

Dataset options

SAS operators

Formats

Subsetting datasets

Dictionary tables

Summary

Data Manipulation and Transformation

Length of a variable

Case conversion and alignment

String identification

Dealing with blanks

Missing and multiple values

Interval calculations

Concatenation

Logic and control

Number manipulation

Summary

Section 2: Merging, Optimizing, and Descriptive Statistics

Combining, Indexing, Encryption, and Compression Techniques Simplified

Introduction to combining

Concatenation

Interleaving

Merging

Indexing

Encryption

Summary

Power of Statistics, Reporting, Transforming Procedures, and Functions

Proc Freq

Proc Univariate

Proc Means and Summary

Proc Corr

Proc REG

Proc Transpose

Summary

Section 3: Advanced Programming

Advanced Programming Techniques - SAS Macros

What are macros?

Macro variable processing

Macro resolution tracking

Macro definition processing

Comparing positional and keywords parameters

Data-driven programming

Leveraging automatic global macro variables

Macros that evaluate

Writing efficient macros

Summary

Powerful Functions, Options, and Automatic Variables Simplified

NOMPREPLACE and MREPLACE

NOMCOMPILE and NCOMPILE

MCOMPILENOTE

NOMEXECNOTE and MEXECNOTE

MAUTOCOMPLOC

MACRO and NOMACRO

Exchanging values between the DATA step and macro variables

CALL EXECUTE

Altering the CALL SYMPUT example

Resolving macro variables

Macro quoting

Summary

Section 4: SQL in SAS

Advanced Programming Techniques Using PROC SQL

Comparing data steps and Proc SQL

Proc SQL joins

Proc SQL essentials

Dictionary tables

Summary

Deep Dive into PROC SQL

SAS views in Proc SQL

Making changes with Proc SQL

Identifying duplicates using Proc SQL

Creating an index in Proc SQL

Macros and Proc SQL

Summary

Section 5: Data Visualization and Reporting

Data Visualization

The role of data visualization in analytics

Histograms

Line plots

Vertical and horizontal bar charts

Scatter charts

Box plot

Summary

Reporting and Output Delivery System

Proc Tabulate

Specifying the ODS destination

Formatting ODS files

ODS Excel charts

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Identifying duplicates using Proc SQL

The simplest way to remove duplicates in Proc SQL is by using the Distinct statement. We will use it on the Dealership_Looped dataset, where the i column, which is used as a looping counter, has been dropped:

Proc Sql;
  Create Table Distinct_Dealership_Looped As
      Select Distinct *
    From Dealership_Looped
  ;
Quit;

Using the Distinct statement, we have correctly identified the duplicates we created as part of the DO LOOPS. We are now left with the original number of 36 records we had. This can be confirmed by looking at the following LOG:

NOTE: Table WORK.DISTINCT_DEALERSHIP_LOOPED created, with 36 rows and 6 columns.
 
NOTE: PROCEDURE SQL used (Total process time):
       real time 1:56.01
       cpu time 1:05.78

Let's find out how would we have fared in terms of runtime if we had used PROC SORT. After all, PROC SORT is the most popular...

Tech Concepts

Programming languages

Tech Tools