R Machine Learning Essentials

Book Image

R Machine Learning Essentials

By : Michele Usuelli

Book Image

R Machine Learning Essentials

By: Michele Usuelli

Overview of this book

R Machine Learning Essentials

R Machine Learning Essentials

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Transforming Data into Actions

Transforming Data into Actions

A data-driven approach in business decisions

Identifying hidden patterns

Estimating the impact of an action

R – A Powerful Tool for Developing Machine Learning Algorithms

R – A Powerful Tool for Developing Machine Learning Algorithms

Some useful R packages

A Simple Machine Learning Analysis

A Simple Machine Learning Analysis

Exploring data interactively

Exploring the data using machine learning models

Predicting newer outcomes

Step 1 – Data Exploration and Feature Engineering

Step 1 – Data Exploration and Feature Engineering

Building a machine learning solution

Building the feature data

Exploring and visualizing the features

Modifying the features

Ranking the features using a filter or a dimensionality reduction

Step 2 – Applying Machine Learning Techniques

Step 2 – Applying Machine Learning Techniques

Identifying a homogeneous group of items

Applying the k-nearest neighbor algorithm

Optimizing the k-nearest neighbor algorithm

Step 3 – Validating the Results

Step 3 – Validating the Results

Validating a machine learning model

Tuning the parameters

Selecting the data features to include in the model

Tuning features and parameters together

Overview of Machine Learning Techniques

Overview of Machine Learning Techniques

Supervised learning

Linear regression

Unsupervised learning

Machine Learning Examples Applicable to Businesses

Machine Learning Examples Applicable to Businesses

Overview of the problem

Clustering the clients

Predicting the output

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Building the feature data

This section shows how we can structure the raw data to build the features. For each country, the data is:

A picture of the flag
Some geographical data such as continent, geographic quadrant, area, and population
The language and religion of the country

The target is to build a model that predicts a country language starting from its flag. Most of the models can deal with numeric and/or categorical data, so we can't use the image of the flag as a feature for the model. The solution is to define some features, for instance the number of colors, that describe each flag. In this way, we start from a table whose rows correspond to the countries and whose columns correspond to the flag features.

It would take a lot of time to build the matrix with the flag attributes based on the pictures. Fortunately, we can use a dataset that contains some features. The data that we have is still a bit messy, so we need to clean and transform it to build a feature table in the right format...