Machine Learning with R Quick Start Guide

By : Iván Pastor Sanz

Machine Learning with R Quick Start Guide

By: Iván Pastor Sanz

Overview of this book

Machine Learning with R Quick Start Guide takes you on a data-driven journey that starts with the very basics of R and machine learning. It gradually builds upon core concepts so you can handle the varied complexities of data and understand each stage of the machine learning pipeline. From data collection to implementing Natural Language Processing (NLP), this book covers it all. You will implement key machine learning algorithms to understand how they are used to build smart models. You will cover tasks such as clustering, logistic regressions, random forests, support vector machines, and more. Furthermore, you will also look at more advanced aspects such as training neural networks and topic modeling. By the end of the book, you will be able to apply the concepts of machine learning, deal with data-related problems, and solve them using the powerful yet simple language that is R.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

R Fundamentals for Machine Learning

R and RStudio installation

Some basic commands

Objects, special cases, and basic operators in R

Controlling code flow

All about R packages

Taking further steps

Summary

Predicting Failures of Banks - Data Collection

Collecting financial data

Collecting the target variable

Structuring data

Summary

Predicting Failures of Banks - Descriptive Analysis

Data overview

Implementing descriptive analysis

Summary

Predicting Failures of Banks - Univariate Analysis

Feature selection algorithm

Filter methods

Wrapper methods

Embedded methods

Dimensionality reduction

Summary

Predicting Failures of Banks - Multivariate Analysis

Logistic regression

Regularized methods

Testing a random forest model

Gradient boosting

Deep learning in neural networks

Support vector machines

Ensembles

Automatic machine learning

Summary

Visualizing Economic Problems in the European Union

A general overview of economic problems in countries

Clustering countries based on macroeconomic imbalances

Summary

Sovereign Crisis - NLP and Topic Modeling

Predicting country ratings using macroeconomic information

Implementing decision trees

Predicting sovereign ratings using European country reports

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Predicting country ratings using macroeconomic information

In our clustering model, discussed in Chapter 6, Visualizing Economic Problems in the European Union, using self-organizing maps, all the available data was used. Now, in order to train a model to be able to predict sovereign ratings, we need to split the data into two samples: train and test.

That's not new for us. When we tried to develop different models to predict a bank's failures, we used the caTools package to split the data, while considering our target variable.

The same procedure is used again here:

library(caTools)
 
index = sample.split(macroeconomic_data$RatingMayT1, SplitRatio = .75)
 
train_macro<-subset(macroeconomic_data, index == TRUE)
test_macro<-subset(macroeconomic_data, index == FALSE)

Now, you can print the following statements:

print(paste("The number of observations in the train...

Machine Learning with R Quick Start Guide

By : Iván Pastor Sanz

Machine Learning with R Quick Start Guide

By: Iván Pastor Sanz

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning with R Quick Start Guide

Mastering Machine Learning with R

Machine Learning with R Cookbook

Ensemble Machine Learning Cookbook