Machine Learning Quick Reference

Book Image

Machine Learning Quick Reference

By : Rahul Kumar

Book Image

Machine Learning Quick Reference

By: Rahul Kumar

Overview of this book

Machine learning makes it possible to learn about the unknowns and gain hidden insights into your datasets by mastering many tools and techniques. This book guides you to do just that in a very compact manner. After giving a quick overview of what machine learning is all about, Machine Learning Quick Reference jumps right into its core algorithms and demonstrates how they can be applied to real-world scenarios. From model evaluation to optimizing their performance, this book will introduce you to the best practices in machine learning. Furthermore, you will also look at the more advanced aspects such as training neural networks and work with different kinds of data, such as text, time-series, and sequential data. Advanced methods and techniques such as causal inference, deep Gaussian processes, and more are also covered. By the end of this book, you will be able to train fast, accurate machine learning models at your fingertips, which you can easily use as a point of reference.

Title Page

Copyright and Credits

Copyright and Credits

About Packt

Contributors

Preface

Free Chapter

Quantifying Learning Algorithms

Quantifying Learning Algorithms

Statistical models

Statistical modeling – the two cultures of Leo Breiman

Training data development data – test data

Bias-variance trade off

Cross-validation and model selection

Model selection using cross-validation

0.632 rule in bootstrapping

Model evaluation

Receiver operating characteristic curve

Dimensionality reduction

Evaluating Kernel Learning

Evaluating Kernel Learning

Introduction to vectors

Linear separability

SVM example and parameter optimization through grid search

Performance in Ensemble Learning

Performance in Ensemble Learning

What is ensemble learning?

Random forest algorithm

Training Neural Networks

Training Neural Networks

Neural networks

Network initialization

Prevention of overfitting in NNs

Vanishing gradient

Recurrent neural networks

Time Series Analysis

Time Series Analysis

Introduction to time series analysis

Autocorrelation

Moving average model

Autoregressive integrated moving average

Optimization of parameters

Anomaly detection

Natural Language Processing

Natural Language Processing

Sentiment analysis

The Bayes theorem

Temporal and Sequential Pattern Discovery

Temporal and Sequential Pattern Discovery

Association rules

Apriori algorithm

Frequent pattern growth

Probabilistic Graphical Models

Probabilistic Graphical Models

Selected Topics in Deep Learning

Selected Topics in Deep Learning

Deep neural networks

Backward propagation

Forward propagation equation

Backward propagation equation

Parameters and hyperparameters

Bias initialization

Generative adversarial networks

Hinton's Capsule network

Causal Inference

Causal Inference

Granger causality

Graphical causal models

Advanced Methods

Advanced Methods

Independent component analysis

Compressed sensing

Self-organizing maps

Bayesian multiple imputation

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Random forest algorithm

The random forest algorithm works with the bagging technique. The number of trees are planted and grown in the following manner:

There are N observations in the training set. Samples out of N observations are taken at random and with replacement. These samples will act as a training set for different trees.
If there are M input features (variables), m features are drawn as a subset out of M and of course m < M. What this does is select m features at random at each node of the tree.
Every tree is grown to the largest extent possible.

Prediction takes place based on the aggregation of the results coming out of all the trees. In the case of classification, the method of aggregation is voting, whereas it is an average of all the results in the case of regression:

Let's work on a case study, since that will help us understand this concept more in detail. Let's work on breast cancer data.

Case study

The data that is given in this case study is about patients who were detected...