Book Image

IBM SPSS Modeler Cookbook

Book Image

IBM SPSS Modeler Cookbook

Overview of this book

IBM SPSS Modeler is a data mining workbench that enables you to explore data, identify important relationships that you can leverage, and build predictive models quickly allowing your organization to base its decisions on hard data not hunches or guesswork. IBM SPSS Modeler Cookbook takes you beyond the basics and shares the tips, the timesavers, and the workarounds that experts use to increase productivity and extract maximum value from data. The authors of this book are among the very best of these exponents, gurus who, in their brilliant and imaginative use of the tool, have pushed back the boundaries of applied analytics. By reading this book, you are learning from practitioners who have helped define the state of the art. Follow the industry standard data mining process, gaining new skills at each stage, from loading data to integrating results into everyday business practices. Get a handle on the most efficient ways of extracting data from your own sources, preparing it for exploration and modeling. Master the best methods for building models that will perform well in the workplace. Go beyond the basics and get the full power of your data mining workbench with this practical guide.
Table of Contents (11 chapters)
10
Index

Cartesian product merge using key-less merge by key


Preparing data for analysis requires a wide range of different operations, because each different kind of analysis requires the data to be in the appropriate form for that analysis. In some examples, two or more lists of items must be joined together in such a way that the result is every possible combination of items, one from each of the lists. This is called a Cartesian product, and in Modeler this is performed using a merge by key operations where no key is specified.

Getting ready

This recipe requires no datafile because the example data is generated by user input source nodes and the stream file required is Cartesian_Product.str

How to do it...

To perform a Cartesian product merge where no key is specified:

  1. Open the stream Cartesian_Product.str by navigating to File | Open Stream.

  2. Run the four Table nodes to the left, ABC, PQR, XYZ, and 123. This will display the four data sets, generated by the user input source nodes, that will be used...