Book Image

Hands-On SAS for Data Analysis

By : Harish Gulati
Book Image

Hands-On SAS for Data Analysis

By: Harish Gulati

Overview of this book

SAS is one of the leading enterprise tools in the world today when it comes to data management and analysis. It enables the fast and easy processing of data and helps you gain valuable business insights for effective decision-making. This book will serve as a comprehensive guide that will prepare you for the SAS certification exam. After a quick overview of the SAS architecture and components, the book will take you through the different approaches to importing and reading data from different sources using SAS. You will then cover SAS Base and 4GL, understanding data management and analysis, along with exploring SAS functions for data manipulation and transformation. Next, you'll discover SQL procedures and get up to speed on creating and validating queries. In the concluding chapters, you'll learn all about data visualization, right from creating bar charts and sample geographic maps through to assigning patterns and formats. In addition to this, the book will focus on macro programming and its advanced aspects. By the end of this book, you will be well versed in SAS programming and have the skills you need to easily handle and manage your data-related problems in SAS.
Table of Contents (17 chapters)
Free Chapter
1
Section 1: SAS Basics
4
Section 2: Merging, Optimizing, and Descriptive Statistics
7
Section 3: Advanced Programming
10
Section 4: SQL in SAS
13
Section 5: Data Visualization and Reporting

Proc Corr

Correlation can be defined as the linear relationship between multiple variables. Positive, negative, and no correlation are the three scenarios that can exist. The values range from -1 to 1. A positive correlation is observed when the value of one variable increases with the increase of another variable, whereas in negative correlation, with an increase in the value of one variable, the value of another variable decreases.

Correlation is often run as a precursor to evaluating the role of variables in models such as regression. In the following instance, we have multiple variables in the Var statement that are being assessed as predictors for the Stock variable. The Date variable is an ID variable and is not being assessed:

Proc Corr Data = Model;
ID Date;
With Stock;
Var Basket_Index -- M1_Money_Supply_Index;
Run;

The Corr procedure produces basic statistical measures similar...