Book Image

Mastering SQL Server 2014 Data Mining

By : Amarpreet Singh Bassan, Debarchan Sarkar
Book Image

Mastering SQL Server 2014 Data Mining

By: Amarpreet Singh Bassan, Debarchan Sarkar

Overview of this book

<p>Whether you are new to data mining or are a seasoned expert, this book will provide you with the skills you need to successfully create, customize, and work with Microsoft Data Mining Suite. Starting with the basics, this book will cover how to clean the data, design the problem, and choose a data mining model that will give you the most accurate prediction.</p> <p>Next, you will be taken through the various classification models such as the decision tree data model, neural network model, as well as Naïve Bayes model. Following this, you'll learn about the clustering and association algorithms, along with the sequencing and regression algorithms, and understand the data mining expressions associated with each algorithm. With ample screenshots that offer a step-by-step account of how to build a data mining solution, this book will ensure your success with this cutting-edge data mining system.</p>
Table of Contents (17 chapters)
Mastering SQL Server 2014 Data Mining
Credits
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 8. Data Mining Using Excel and Big Data

In the previous chapters, we used various data mining algorithms to find the data that we were looking for. With Microsoft Excel becoming more and more powerful in terms of performing data analysis and having connectivity capabilities to many data sources, an addition of the data mining capabilities is only a logical enhancement to its capabilities.

The state of data today is aptly described by volume, variety, and velocity. Volume describes the quantity of data generated, variety describes the type of data that is generated from different sources, and velocity is the rate at which the data gets generated. The most common examples are social media platforms such as Facebook, Twitter, and so on, where we have posts and tweets from millions of people on a daily basis. These comprise of photos, audio files, video files, texts, and so on. These posts and tweets take some several terabytes of space in the data centers of Facebook and Twitter. Traditional...