Machine Learning with R - Fourth Edition

By : Brett Lantz

5 (1)

Buy this Book

Machine Learning with R - Fourth Edition

5 (1)

By: Brett Lantz

Buy this Book

Overview of this book

Dive into R with this data science guide on machine learning (ML). Machine Learning with R, Fourth Edition, takes you through classification methods like nearest neighbor and Naive Bayes and regression modeling, from simple linear to logistic. Dive into practical deep learning with neural networks and support vector machines and unearth valuable insights from complex data sets with market basket analysis. Learn how to unlock hidden patterns within your data using k-means clustering. With three new chapters on data, you’ll hone your skills in advanced data preparation, mastering feature engineering, and tackling challenging data scenarios. This book helps you conquer high-dimensionality, sparsity, and imbalanced data with confidence. Navigate the complexities of big data with ease, harnessing the power of parallel computing and leveraging GPU resources for faster insights. Elevate your understanding of model performance evaluation, moving beyond accuracy metrics. With a new chapter on building better learners, you’ll pick up techniques that top teams use to improve model performance with ensemble methods and innovative model stacking and blending techniques. Machine Learning with R, Fourth Edition, equips you with the tools and knowledge to tackle even the most formidable data challenges. Unlock the full potential of machine learning and become a true master of the craft.

Preface

Who this book is for

What this book covers

What you need for this book

Get in touch

Introducing Machine Learning

The origins of machine learning

Uses and abuses of machine learning

How machines learn

Machine learning in practice

Machine learning with R

Summary

Free Chapter

Managing and Understanding Data

R data structures

Managing data with R

Exploring and understanding data

Summary

Lazy Learning – Classification Using Nearest Neighbors

Understanding nearest neighbor classification

Example – diagnosing breast cancer with the k-NN algorithm

Summary

Probabilistic Learning – Classification Using Naive Bayes

Understanding Naive Bayes

Example – filtering mobile phone spam with the Naive Bayes algorithm

Summary

Divide and Conquer – Classification Using Decision Trees and Rules

Understanding decision trees

Example – identifying risky bank loans using C5.0 decision trees

Understanding classification rules

Example – identifying poisonous mushrooms with rule learners

Summary

Forecasting Numeric Data – Regression Methods

Understanding regression

Example – predicting auto insurance claims costs using linear regression

Understanding regression trees and model trees

Example – estimating the quality of wines with regression trees and model trees

Summary

Black-Box Methods – Neural Networks and Support Vector Machines

Understanding neural networks

Example – modeling the strength of concrete with ANNs

Understanding support vector machines

Example – performing OCR with SVMs

Summary

Finding Patterns – Market Basket Analysis Using Association Rules

Understanding association rules

Example – identifying frequently purchased groceries with association rules

Summary

Finding Groups of Data – Clustering with k-means

Understanding clustering

Finding teen market segments using k-means clustering

Summary

Evaluating Model Performance

Measuring performance for classification

Estimating future performance

Summary

Being Successful with Machine Learning

What makes a successful machine learning practitioner?

What makes a successful machine learning model?

Putting the “science” in data science

Summary

Advanced Data Preparation

Performing feature engineering

Feature engineering in practice

Exploring R’s tidyverse

Summary

Challenging Data – Too Much, Too Little, Too Complex

The challenge of high-dimension data

Making use of sparse data

Handling missing data

The problem of imbalanced data

Summary

Building Better Learners

Tuning stock models for better performance

Improving model performance with ensembles

Stacking models for meta-learning

Summary

Making Use of Big Data

Practical applications of deep learning

Unsupervised learning and big data

Adapting R to handle large datasets

Summary

Other Books You May Enjoy

Index

Customer Reviews

5 (1)

5 star

100%

4 star

3 star

2 star

1 star

What this book covers

Chapter 1, Introducing Machine Learning, presents the terminology and concepts that define and distinguish machine learners, as well as a method for matching a learning task with the appropriate algorithm.

Chapter 2, Managing and Understanding Data, provides an opportunity to get your hands dirty working with data in R. Essential data structures and procedures used for loading, exploring, and understanding data are discussed.

Chapter 3, Lazy Learning – Classification Using Nearest Neighbors, teaches you how to understand and apply a simple yet powerful machine learning algorithm to your first real-world task: identifying malignant samples of cancer.

Chapter 4, Probabilistic Learning – Classification Using Naive Bayes, reveals the essential concepts of probability that are used in cutting-edge spam filtering systems. You’ll learn the basics of text mining in the process of building your own spam filter.

Chapter 5, Divide and Conquer – Classification Using Decision Trees and Rules, explores a couple of learning algorithms whose predictions are not only accurate, but also easily explained. We’ll apply these methods to tasks where transparency is important.

Chapter 6, Forecasting Numeric Data – Regression Methods, introduces machine learning algorithms used for making numeric predictions. As these techniques are heavily embedded in the field of statistics, you will also learn the essential metrics needed to make sense of numeric relationships.

Chapter 7, Black-Box Methods – Neural Networks and Support Vector Machines, covers two complex but powerful machine learning algorithms. Though the math may appear intimidating, we will work through examples that illustrate their inner workings in simple terms.

Chapter 8, Finding Patterns – Market Basket Analysis Using Association Rules, exposes the algorithm used in the recommendation systems employed by many retailers. If you’ve ever wondered how retailers seem to know your purchasing habits better than you know yourself, this chapter will reveal their secrets.

Chapter 9, Finding Groups of Data – Clustering with k-means, is devoted to a procedure that locates clusters of related items. We’ll utilize this algorithm to identify profiles within an online community.

Chapter 10, Evaluating Model Performance, provides information on measuring the success of a machine learning project and obtaining a reliable estimate of the learner’s performance on future data.

Chapter 11, Being Successful with Machine Learning, describes the common pitfalls faced when transitioning from textbook datasets to real world machine learning problems, as well as the tools, strategies, and soft skills needed to combat these issues.

Chapter 12, Advanced Data Preparation, introduces the set of “tidyverse” packages, which help wrangle large datasets to extract meaningful information to aid the machine learning process.

Chapter 13, Challenging Data – Too Much, Too Little, Too Complex, considers solutions to a common set of problems that can derail a machine learning project when the useful information is lost within a massive dataset, much like a needle in a haystack.

Chapter 14, Building Better Learners, reveals the methods employed by the teams at the top of machine learning competition leaderboards. If you have a competitive streak, or simply want to get the most out of your data, you’ll need to add these techniques to your repertoire.

Chapter 15, Making Use of Big Data, explores the frontiers of machine learning. From working with extremely large datasets to making R work faster, the topics covered will help you push the boundaries of what is possible with R, and even allow you to utilize the sophisticated tools developed by large organizations like Google for image recognition and understanding text data.

Machine Learning with R - Fourth Edition

By : Brett Lantz

Machine Learning with R - Fourth Edition

By: Brett Lantz

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning with R - Fourth Edition

Mastering Machine Learning with R, Second Edition

Practical Predictive Analytics

Mastering Machine Learning with R.

What this book covers