Predictive Analytics Using Rattle and Qlik Sense

Book Image

Predictive Analytics Using Rattle and Qlik Sense

By : Ferran Garcia Pagans, Fernando G Pagans

Book Image

Predictive Analytics Using Rattle and Qlik Sense

By: Ferran Garcia Pagans, Fernando G Pagans

Overview of this book

Predictive Analytics Using Rattle and Qlik Sense

Predictive Analytics Using Rattle and Qlik Sense

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Getting Ready with Predictive Analytics

Getting Ready with Predictive Analytics

Analytics, predictive analytics, and data visualization

Purpose of the book

Introducing R, Rattle, and Qlik Sense Desktop

Installing the environment

Downloading and installing Rattle

Installing Qlik Sense Desktop

Exploring Qlik Sense Desktop

Further learning

Preparing Your Data

Preparing Your Data

Datasets, observations, and variables

Transforming data

Further learning

Exploring and Understanding Your Data

Exploring and Understanding Your Data

Visualizing distributions

Correlations among input variables

Further learning

Creating Your First Qlik Sense Application

Creating Your First Qlik Sense Application

Customer segmentation and customer buying behavior

Loading data and creating a data model

Creating a simple data app

Associative logic

Creating charts

Analyzing your data

Further learning

Clustering and Other Unsupervised Learning Methods

Clustering and Other Unsupervised Learning Methods

Machine learning – unsupervised and supervised learning

Further learning

Decision Trees and Other Supervised Learning Methods

Decision Trees and Other Supervised Learning Methods

Partitioning datasets and model optimization

Decision Tree Learning

Entropy and information gain

Underfitting and overfitting

Using a Decision Tree to classify credit risks

Ensemble classifiers

Further learning

Model Evaluation

Model Evaluation

Cross-validation

Regression performance

Measuring the performance of classifiers

Further learning

Visualizations, Data Applications, Dashboards, and Data Storytelling

Visualizations, Data Applications, Dashboards, and Data Storytelling

Data visualization in Qlik Sense

Data analysis, data applications, and dashboards

Data storytelling with Qlik Sense

Further learning

Developing a Complete Application

Developing a Complete Application

Understanding the bike rental problem

Exploring the data with Qlik Sense

Creating a Qlik Sense App to control the activity

Using Rattle to forecast the demand

Model evaluation

Further learning

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Partitioning datasets and model optimization

As we've explained, in supervised learning, we split the dataset in three subsets—training, validation, and testing:

To create the model or learner, Rattle uses the training dataset. After creating a model, we use the validation data to evaluate its performance. To improve the performance, depending on the algorithm we're using, we can use different tuning options. After tuning, we rebuild the model and evaluate its performance again. This is an iterative process; we create the model and evaluate it until we're fine with its performance.

For simplicity, in this chapter, we'll see only model creation, and in the following chapter, we'll see model optimization, but in real life, this is an iterative process.

The examples in this chapter will not have any optimization.

Finally, when you're happy with the model, you can use the testing dataset to confirm its performance. You need to use the testing dataset because you've used the validation dataset to...