Book Image

Data Analysis with IBM SPSS Statistics

By : Ken Stehlik-Barry, Anthony Babinec
Book Image

Data Analysis with IBM SPSS Statistics

By: Ken Stehlik-Barry, Anthony Babinec

Overview of this book

SPSS Statistics is a software package used for logical batched and non-batched statistical analysis. Analytical tools such as SPSS can readily provide even a novice user with an overwhelming amount of information and a broad range of options for analyzing patterns in the data. The journey starts with installing and configuring SPSS Statistics for first use and exploring the data to understand its potential (as well as its limitations). Use the right statistical analysis technique such as regression, classification and more, and analyze your data in the best possible manner. Work with graphs and charts to visualize your findings. With this information in hand, the discovery of patterns within the data can be undertaken. Finally, the high level objective of developing predictive models that can be applied to other situations will be addressed. By the end of this book, you will have a firm understanding of the various statistical analysis techniques offered by SPSS Statistics, and be able to master its use for data analysis with ease.
Table of Contents (17 chapters)
4
Dealing with Missing Data and Outliers
10
Crosstabulation Patterns for Categorical Data

Twostep cluster analysis example

For this example, we return to the USA states violent crime data example. Recall that TWOSTEP CLUSTER offers an automatic method for selecting the number of clusters, as well as a Likelihood distance measure. We will run it to show some of the visuals in the model viewer output.

The approach here is to:

  1. First run TWOSTEP CLUSTER in automatic mode to identify a tentative number of clusters.
  2. Then run TWOSTEP CLUSTER again with a specified number of clusters.

Here is the SPSS code for the first run:

TWOSTEP CLUSTER
/CONTINUOUS VARIABLES=MurderR RRapeR RobberyR AssaultR BurglaryR LarcenyR VehicleTheftR
/DISTANCE Likelihood
/NUMCLUSTERS AUTO 15 BIC
/HANDLENOISE 0
/MEMALLOCATE 64
/CRITERIA INITHRESHOLD(0) MXBRANCH(8) MXLEVEL(3)
/VIEWMODEL DISPLAY=YES
/PRINT IC COUNT SUMMARY.

Here are comments on the SPSS code:

  • In a step not shown, the variable names...