Book Image

Hands-On SAS for Data Analysis

By : Harish Gulati
Book Image

Hands-On SAS for Data Analysis

By: Harish Gulati

Overview of this book

SAS is one of the leading enterprise tools in the world today when it comes to data management and analysis. It enables the fast and easy processing of data and helps you gain valuable business insights for effective decision-making. This book will serve as a comprehensive guide that will prepare you for the SAS certification exam. After a quick overview of the SAS architecture and components, the book will take you through the different approaches to importing and reading data from different sources using SAS. You will then cover SAS Base and 4GL, understanding data management and analysis, along with exploring SAS functions for data manipulation and transformation. Next, you'll discover SQL procedures and get up to speed on creating and validating queries. In the concluding chapters, you'll learn all about data visualization, right from creating bar charts and sample geographic maps through to assigning patterns and formats. In addition to this, the book will focus on macro programming and its advanced aspects. By the end of this book, you will be well versed in SAS programming and have the skills you need to easily handle and manage your data-related problems in SAS.
Table of Contents (17 chapters)
Free Chapter
1
Section 1: SAS Basics
4
Section 2: Merging, Optimizing, and Descriptive Statistics
7
Section 3: Advanced Programming
10
Section 4: SQL in SAS
13
Section 5: Data Visualization and Reporting

Proc SQL joins

While introducing you to SAS programming we looked at the structure of the Proc SQL query. Let's look at the various join options in SAS. We will try and create a basic join without imposing any conditions.

We will use the Class dataset used in previous datasets and the following Grade dataset:

Data Grade;
Input ClassID $ Year Grade $;
Datalines;
A1234 2013 A
A2323 2013 A
B3423 2013 B
B5324 2013 C
C2342 2013 C
D3242 2013 D
A1234 2019 B
A2323 2019 C
B3423 2019 D
B5324 2019 B
C2342 2019 C
D3242 2019 D
;

The following code will produce a dataset that contains the yearly grades of the students found in the Class dataset:

Proc SQL;
Create Table Class_Grade As
Select *
From Class, Grade
;
Quit;

After executing the preceding code, the key to understanding the joints doesn't lie in just exploring the results. Instead, first let&apos...