Mastering SQL Server 2014 Data Mining

Mastering SQL Server 2014 Data Mining

By : Amarpreet Singh Bassan, Debarchan Sarkar

Buy this Book

Mastering SQL Server 2014 Data Mining

By: Amarpreet Singh Bassan, Debarchan Sarkar

Buy this Book

Overview of this book

<p>Whether you are new to data mining or are a seasoned expert, this book will provide you with the skills you need to successfully create, customize, and work with Microsoft Data Mining Suite. Starting with the basics, this book will cover how to clean the data, design the problem, and choose a data mining model that will give you the most accurate prediction.</p> <p>Next, you will be taken through the various classification models such as the decision tree data model, neural network model, as well as Naïve Bayes model. Following this, you'll learn about the clustering and association algorithms, along with the sequencing and regression algorithms, and understand the data mining expressions associated with each algorithm. With ample screenshots that offer a step-by-step account of how to build a data mining solution, this book will ensure your success with this cutting-edge data mining system.</p>

Mastering SQL Server 2014 Data Mining

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Identifying, Staging, and Understanding Data

Data mining life cycle

Staging data

Understanding and cleansing data

Summary

Data Model Preparation and Deployment

Preparing data models

Validating data models

Deploying data models

Summary

Tools of the Trade

SQL Server BI Suite

References

Summary

Preparing the Data

Listing of popular databases

Summary

Classification Models

Input, output, and predicted columns

The feature selection

The Microsoft Decision Tree algorithm

The Microsoft Neural Network algorithm

The Microsoft Naïve Bayes algorithm

Summary

Segmentation and Association Models

The Microsoft Clustering algorithm

The Microsoft Association algorithm

Summary

Sequence and Regression Models

The Microsoft Sequence Clustering algorithm

The Microsoft Time Series algorithm

Summary

Data Mining Using Excel and Big Data

Data mining using Microsoft Excel

Data mining using HDInsight and Microsoft Azure Machine Learning

Summary

Tuning the Models

Getting the real-world data

Adding a clustering model to the data mining structure

Adding the Neural Network model to the data mining structure

Summary

Troubleshooting

A fraction of rows get transferred into a SQL table

Error during changing of the data type of the table

Troubleshooting the data mining structure performance

Error during the deployment of a model

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Troubleshooting the data mining structure performance

There can be many reasons why this might happen; in our case, if we make all the columns as input and try to generate the data mining structure, it will definitely take a long time because there will be a large number of trees or clusters to be generated depending on the algorithm that we are using. We will discuss a few of the commonly used algorithms and the parameters that can be altered to improve the processing performance.

The Decision Tree algorithm

The information that is required to classify data will proportionately increase with the increase in the input values. Therefore, there is a need to optimize the performance. The performance can be optimized by the following aspects:

Reducing the number of inputs
While grouping the items into bins, group only those values that provide the maximum information

We want to reduce the tree growth while trying not to lose the consistency and accuracy of the model. The following parameters help...

Mastering SQL Server 2014 Data Mining

By : Amarpreet Singh Bassan, Debarchan Sarkar

Mastering SQL Server 2014 Data Mining

By: Amarpreet Singh Bassan, Debarchan Sarkar

Overview of this book

Related Content you might be interested in

Current Title:

Mastering SQL Server 2014 Data Mining

Troubleshooting the data mining structure performance

The Decision Tree algorithm