Buy this Book

Buy this Book

Mastering Predictive Analytics with R

About the Author

Acknowledgments

About the Reviewers

www.PacktPub.com

Free Chapter

Gearing Up for Predictive Modeling

Models

Types of models

The process of predictive modeling

Performance metrics

Summary

Linear Regression

Introduction to linear regression

Simple linear regression

Multiple linear regression

Assessing linear regression models

Problems with linear regression

Feature selection

Regularization

Summary

Logistic Regression

Classifying with linear regression

Introduction to logistic regression

Predicting heart disease

Assessing logistic regression models

Regularization with the lasso

Classification metrics

Extensions of the binary logistic classifier

Summary

Neural Networks

The biological neuron

The artificial neuron

Stochastic gradient descent

Multilayer perceptron networks

Predicting the energy efficiency of buildings

Predicting glass type revisited

Predicting handwritten digits

Summary

Support Vector Machines

Maximal margin classification

Support vector classification

Kernels and support vector machines

Predicting chemical biodegration

Cross-validation

Predicting credit scores

Multiclass classification with support vector machines

Summary

Tree-based Methods

The intuition for tree models

Algorithms for training decision trees

Predicting class membership on synthetic 2D data

Predicting the authenticity of banknotes

Predicting complex skill learning

Summary

Ensemble Methods

Bagging

Boosting

Predicting atmospheric gamma ray radiation

Predicting complex skill learning with boosting

Random forests

Summary

Probabilistic Graphical Models

A little graph theory

Bayes' Theorem

Conditional independence

Bayesian networks

The Naïve Bayes classifier

Hidden Markov models

Predicting promoter gene sequences

Predicting letter patterns in English words

Summary

Time Series Analysis

Fundamental concepts of time series

Some fundamental time series

Stationarity

Stationary time series models

Non-stationary time series models

Predicting intense earthquakes

Predicting lynx trappings

Predicting foreign exchange rates

Other time series models

Summary

Topic Modeling

An overview of topic modeling

Latent Dirichlet Allocation

Modeling the topics of online news stories

Summary

Recommendation Systems

Rating matrix

Collaborative filtering

Singular value decomposition

R and Big Data

Predicting recommendations for movies and jokes

Loading and preprocessing the data

Exploring the data

Other approaches to recommendation systems

Summary

Customer Reviews

Index

A

ACF function
- about / Time series summary functions
activators
- about / The biological neuron
acyclic graph
- about / A little graph theory
AdaBoost / AdaBoost
AdaBoost, for binary classification
- inputs / AdaBoost
- output / AdaBoost
- observations / AdaBoost
adaptive boosting
- about / AdaBoost
additive smoothing
- about / Predicting the sentiment of movie reviews
Akaike Information Criterion (AIC)
- about / Comparing different regression models
algorithms
- building, to train decision trees / Algorithms for training decision trees
analysis of variance
- about / Significance tests for linear regression
ARCH models / Autoregressive conditional heteroscedasticity models
ARIMA models / Autoregressive integrated moving average models
ARMA model / Autoregressive moving average models
artificial neural networks (ANNs)
- about / The biological neuron
artificial neuron
- about / The artificial neuron
atmospheric gamma ray radiation
- predicting / Predicting atmospheric gamma ray radiation
Augmented Dickey-Fuller (ADF) test
- about / Autoregressive integrated moving average models
authenticity, of banknotes
- predicting / Predicting the authenticity of banknotes
author-topic model
- about / LDA extensions
autocorrelation function
- about / Time series summary functions
autocovariance function
- about / Time series summary functions
autoregressive models (AR) / Autoregressive models
axon
- about / The biological neuron
axon terminals
- about / The biological neuron

B

backpropagation algorithm
- about / Training multilayer perceptron networks
backward elimination
- about / Feature selection
backward selection
- about / Feature selection
bagging
- about / Bagging
- margin / Margins and out-of-bag observations
- out-of-bag observations / Margins and out-of-bag observations
- complex skill learning, predicting with / Predicting complex skill learning with bagging
- heart disease, predicting with / Predicting heart disease with bagging
- limitations / Limitations of bagging
bagging, for binary classification
- inputs / Bagging
- output / Bagging
- method / Bagging
Banknote Authentication data set
- URL / Predicting the authenticity of banknotes
batch machine learning model / Real-time and batch machine learning models
Baum-Welch algorithm
- about / Hidden Markov models
Bayesian Information Criterion (BIC)
- about / Comparing different regression models
Bayesian networks
- defining / Bayesian networks
Bayesian probability
- about / Learning from data
Bayes Theorem
- defining / Bayes' Theorem
bias
- about / The biological neuron
Big Data
- handling, in R / R and Big Data
- about / R and Big Data
binary classification models
- assessing / Assessing binary classification models
biological neuron
- about / The biological neuron
boosting
- about / Boosting, Limitations of boosting
- AdaBoost / AdaBoost
- limitations / Limitations of boosting
bootstrapped samples
- about / Margins and out-of-bag observations
bootstrapping
- about / Bagging
bootstrap resampling
- about / Bagging
bootstrap sampling
- about / Bagging
Box-Cox transformation / Feature transformations
Brownian Motion
- about / Random walk

C

D

E

Ecotect / Predicting the energy efficiency of buildings
emitted symbol
- about / Hidden Markov models
energy efficiency, of buildings
- predicting / Predicting the energy efficiency of buildings
Energy Efficiency data set
- URL / Predicting the energy efficiency of buildings
entropy
- about / C5.0
expectation
- about / Estimating the regression coefficients
Expectation Maximization (EM) algorithm
- about / Fitting an LDA model
exploratory data analysis / Exploratory data analysis
exponential smoothing
- about / Other time series models
extensions, binary logistic classifier
- about / Extensions of the binary logistic classifier
- multinomial logistic regression / Multinomial logistic regression
- ordinal logistic regression / Ordinal logistic regression

F

G

H

I

ID3
- about / C5.0
Independence of Irrelevant Alternatives (IIA)
- about / Multinomial logistic regression
independent and identically distributed (iid)
- about / Margins and out-of-bag observations, White noise
information statistic
- about / C5.0
inhibitors
- about / The biological neuron
inner products / Inner products
intense earthquakes
- predicting / Predicting intense earthquakes
intercept
- about / Introduction to linear regression
interquartile range
- about / Residual analysis
invertible
- about / Moving average models
item-based collaborative filtering
- about / Item-based collaborative filtering

J

J48
- about / C5.0

K

L

M

N

O

P

Q

Q-Q plots
- about / Residual analysis
QSAR biodegradation
- URL / Predicting chemical biodegration
Quantile-Quantile plot (Q-Q plot)
- about / Residual analysis

R

S

T

U

V

W

Z

Z-score normalization / Feature transformations

Mastering Predictive Analytics with R

By : Rui Miguel Forte, Rui Miguel Forte

Mastering Predictive Analytics with R

By: Rui Miguel Forte, Rui Miguel Forte

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Predictive Analytics with R

Index

A

B

C

D

E

F

G

H

I

J

K

L

M

N

O

P

Q

R

S

T

U

V

W

Z