Book Image

IBM SPSS Modeler Cookbook

By : Keith McCormick, Abbott
Book Image

IBM SPSS Modeler Cookbook

By: Keith McCormick, Abbott

Overview of this book

IBM SPSS Modeler is a data mining workbench that enables you to explore data, identify important relationships that you can leverage, and build predictive models quickly allowing your organization to base its decisions on hard data not hunches or guesswork. IBM SPSS Modeler Cookbook takes you beyond the basics and shares the tips, the timesavers, and the workarounds that experts use to increase productivity and extract maximum value from data. The authors of this book are among the very best of these exponents, gurus who, in their brilliant and imaginative use of the tool, have pushed back the boundaries of applied analytics. By reading this book, you are learning from practitioners who have helped define the state of the art. Follow the industry standard data mining process, gaining new skills at each stage, from loading data to integrating results into everyday business practices. Get a handle on the most efficient ways of extracting data from your own sources, preparing it for exploration and modeling. Master the best methods for building models that will perform well in the workplace. Go beyond the basics and get the full power of your data mining workbench with this practical guide.
Table of Contents (11 chapters)
10
Index

Creating flag variables for aggregation

The SetToFlag node is a very convenient node that converts a single nominal variable into as many binary columns as desired, one column for each nominal variable value. However, the default values for the node are T and F, which unfortunately cannot be used for any nodes that require numeric values. In this recipe we will create flag variables that can be used in Aggregate nodes, Means nodes, and other numeric operations. Using numeric values (1 and 0 in this recipe) will work with any nodes that require flag or nominal values such as Association Rules and the grouping variable for the Means node (as T and F will), but will also work as numeric values in nodes such as the Aggregate node.

Getting ready

This recipe uses the datafile cup98lrn_reduced_vars3.sav and the stream recipe_variableconstruct_flags.str.

How to do it...

  1. Open the stream recipe_variableconstruct_flags.str by clicking on File | Open Stream.
  2. Make sure the datafile points to the correct...