IBM SPSS Modeler Essentials

IBM SPSS Modeler Essentials

By : Jesus Salcedo, Keith McCormick

Buy this Book

IBM SPSS Modeler Essentials

By: Jesus Salcedo, Keith McCormick

Buy this Book

Overview of this book

IBM SPSS Modeler allows users to quickly and efficiently use predictive analytics and gain insights from your data. With almost 25 years of history, Modeler is the most established and comprehensive Data Mining workbench available. Since it is popular in corporate settings, widely available in university settings, and highly compatible with all the latest technologies, it is the perfect way to start your Data Science and Machine Learning journey. This book takes a detailed, step-by-step approach to introducing data mining using the de facto standard process, CRISP-DM, and Modeler’s easy to learn “visual programming” style. You will learn how to read data into Modeler, assess data quality, prepare your data for modeling, find interesting patterns and relationships within your data, and export your predictions. Using a single case study throughout, this intentionally short and focused book sticks to the essentials. The authors have drawn upon their decades of teaching thousands of new users, to choose those aspects of Modeler that you should learn first, so that you get off to a good start using proven best practices. This book provides an overview of various popular data modeling techniques and presents a detailed case study of how to use CHAID, a decision tree model. Assessing a model’s performance is as important as building it; this book will also show you how to do that. Finally, you will see how you can score new data and export your predictions. By the end of this book, you will have a firm understanding of the basics of data mining and how to effectively use Modeler to build predictive models.

Title Page

Credits

About the Authors

About the Reviewer

www.PacktPub.com

Customer Feedback

Dedication

Preface

Free Chapter

Introduction to Data Mining and Predictive Analytics

Introduction to data mining

CRISP-DM overview

The data mining process (as a case study)

Summary

The Basics of Using IBM SPSS Modeler

Introducing the Modeler graphic user interface

Building streams

Modeler stream rules

Help options

Summary

Importing Data into Modeler

Data structure

Levels of measurement and roles

Summary

Data Quality and Exploration

Data Audit node options

Summary

Cleaning and Selecting Data

Selecting cases

Sorting cases

Identifying and removing duplicate cases

Reclassifying categorical values

Summary

Combining Data Files

Combining data files with the Append node

Removing fields with the Filter node

Combining data files with the Merge node

Summary

Deriving New Fields

Summary

Looking for Relationships Between Fields

Relationships between categorical fields

Relationships between categorical and continuous fields

Relationships between continuous fields

Summary

Introduction to Modeling Options in IBM SPSS Modeler

Classification

Association

Segmentation

Summary

Decision Tree Models

Decision tree theory

CHAID theory

CHAID results

Model Assessment and Scoring

Contrasting model assessment with the Evaluation phase

Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Summary

This chapter introduced the Derive node. The Derive node can perform many different types of calculations so that users can extract more information from the data. These additional fields can then provide insight that may not have been apparent. In this chapter you learned that the Derive node can create fields as formulas, flags, nominals, or conditionals.

In the next chapter, we will continue to explore our data by discovering simple relationships between outcome fields and predictor fields. Specifically, readers will learn to use several statistical and graphing nodes to determine which fields are related to an outcome variable.

IBM SPSS Modeler Essentials

By : Jesus Salcedo, Keith McCormick

IBM SPSS Modeler Essentials

By: Jesus Salcedo, Keith McCormick

Overview of this book

Related Content you might be interested in

Current Title:

IBM SPSS Modeler Essentials

Machine Learning for Data Mining

Data Analysis with IBM SPSS Statistics

Advanced Analytics with R and Tableau

Summary