Mastering SQL Server 2014 Data Mining

Mastering SQL Server 2014 Data Mining

By : Amarpreet Singh Bassan, Debarchan Sarkar

Buy this Book

Mastering SQL Server 2014 Data Mining

By: Amarpreet Singh Bassan, Debarchan Sarkar

Buy this Book

Overview of this book

<p>Whether you are new to data mining or are a seasoned expert, this book will provide you with the skills you need to successfully create, customize, and work with Microsoft Data Mining Suite. Starting with the basics, this book will cover how to clean the data, design the problem, and choose a data mining model that will give you the most accurate prediction.</p> <p>Next, you will be taken through the various classification models such as the decision tree data model, neural network model, as well as Naïve Bayes model. Following this, you'll learn about the clustering and association algorithms, along with the sequencing and regression algorithms, and understand the data mining expressions associated with each algorithm. With ample screenshots that offer a step-by-step account of how to build a data mining solution, this book will ensure your success with this cutting-edge data mining system.</p>

Mastering SQL Server 2014 Data Mining

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Identifying, Staging, and Understanding Data

Data mining life cycle

Staging data

Understanding and cleansing data

Summary

Data Model Preparation and Deployment

Preparing data models

Validating data models

Deploying data models

Summary

Tools of the Trade

SQL Server BI Suite

References

Summary

Preparing the Data

Listing of popular databases

Summary

Classification Models

Input, output, and predicted columns

The feature selection

The Microsoft Decision Tree algorithm

The Microsoft Neural Network algorithm

The Microsoft Naïve Bayes algorithm

Summary

Segmentation and Association Models

The Microsoft Clustering algorithm

The Microsoft Association algorithm

Summary

Sequence and Regression Models

The Microsoft Sequence Clustering algorithm

The Microsoft Time Series algorithm

Summary

Data Mining Using Excel and Big Data

Data mining using Microsoft Excel

Data mining using HDInsight and Microsoft Azure Machine Learning

Summary

Tuning the Models

Getting the real-world data

Adding a clustering model to the data mining structure

Adding the Neural Network model to the data mining structure

Summary

Troubleshooting

A fraction of rows get transferred into a SQL table

Error during changing of the data type of the table

Troubleshooting the data mining structure performance

Error during the deployment of a model

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

The Microsoft Clustering algorithm

The data mining models based on the Microsoft Clustering algorithm is targeted towards identifying the relationships between different entities of the dataset and dividing them into logically related groups. This algorithm differs from other algorithms in such a way that these do not require any predictable columns as their prime motive is to identify the groups of data, rather than to predict the value of an attribute. These groupings can then be used to make predictions, identify exceptions, and so on. Thus, the prime usage of this algorithm lies mainly in the data analysis phase where the focus is mainly on the existing/current data to test our hypothesis about the relationships between entities in the data and determine any exceptions (hidden relationships).

The following screenshot shows a data mining model based on the Microsoft Clustering algorithm. This can be seen in the SSDT Mining Models tab.

An important observation regarding the preceding screenshot...

Mastering SQL Server 2014 Data Mining

By : Amarpreet Singh Bassan, Debarchan Sarkar

Mastering SQL Server 2014 Data Mining

By: Amarpreet Singh Bassan, Debarchan Sarkar

Overview of this book

Related Content you might be interested in

Current Title:

Mastering SQL Server 2014 Data Mining

The Microsoft Clustering algorithm