Book Image

Machine Learning for Data Mining

By : Jesus Salcedo
Book Image

Machine Learning for Data Mining

By: Jesus Salcedo

Overview of this book

Machine learning (ML) combined with data mining can give you amazing results in your data mining work by empowering you with several ways to look at data. This book will help you improve your data mining techniques by using smart modeling techniques. This book will teach you how to implement ML algorithms and techniques in your data mining work. It will enable you to pair the best algorithms with the right tools and processes. You will learn how to identify patterns and make predictions with minimal human intervention. You will build different types of ML models, such as the neural network, the Support Vector Machines (SVMs), and the Decision tree. You will see how all of these models works and what kind of data in the dataset they are suited for. You will learn how to combine the results of different models in order to improve accuracy. Topics such as removing noise and handling errors will give you an added edge in model building and optimization. By the end of this book, you will be able to build predictive models and extract information of interest from the dataset
Table of Contents (7 chapters)

Predicting continuous outcomes

Until now, we have spent all of our time talking about categorical outcomes and most of those examples apply to continuous outcomes, but in this section we're going to focus exclusively on continuous outcome variables.

As I mentioned previously, when we're talking about continuous outcome predictions or variables, everything that we've talked about in this book still applies: the main difference, though, is going to be in terms of how we end up combining predictions.

Here, in this example, we can see that we built three models and we have predictions from each one of those models:

When we want to combine the predictions, all we do is take a mathematical average. The mean of these previous models ends up being the combined prediction because we're not predicting individual categories as we were when we had a categorical outcome...