Scala for Machine Learning

Book Image

Scala for Machine Learning

By : Patrick R. Nicolas

Book Image

Scala for Machine Learning

By: Patrick R. Nicolas

Overview of this book

Scala for Machine Learning

Scala for Machine Learning

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Getting Started

Getting Started

Mathematical notation for the curious

Why machine learning?

Model categorization

Taxonomy of machine learning algorithms

Don't reinvent the wheel!

Tools and frameworks

Let's kick the tires

Hello World!

Defining a methodology

Monadic data transformation

A workflow computational model

Assessing a model

Data Preprocessing

Data Preprocessing

Time series in Scala

Moving averages

Fourier analysis

The discrete Kalman filter

Alternative preprocessing techniques

Unsupervised Learning

Unsupervised Learning

Dimension reduction

Performance considerations

Naïve Bayes Classifiers

Naïve Bayes Classifiers

Probabilistic graphical models

Naïve Bayes classifiers

The Multivariate Bernoulli classification

Naïve Bayes and text mining

Regression and Regularization

Regression and Regularization

Linear regression

Numerical optimization

Logistic regression

Sequential Data Models

Sequential Data Models

Markov decision processes

The hidden Markov model

Conditional random fields

Regularized CRFs and text analytics

Comparing CRF and HMM

Performance consideration

Kernel Models and Support Vector Machines

Kernel Models and Support Vector Machines

Kernel functions

Support vector machines

Support vector classifiers – SVC

Anomaly detection with one-class SVC

Support vector regression

Performance considerations

Artificial Neural Networks

Artificial Neural Networks

Feed-forward neural networks

The multilayer perceptron

Convolution neural networks

Benefits and limitations

Genetic Algorithms

Genetic Algorithms

Genetic algorithms and machine learning

Genetic algorithm components

GA for trading strategies

Advantages and risks of genetic algorithms

Reinforcement Learning

Reinforcement Learning

Reinforcement learning

Learning classifier systems

Scalable Frameworks

Scalable Frameworks

Scalability with Actors

Basic Concepts

Scala programming

Suggested online courses

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Regularization

The ordinary least squares method for finding the regression parameters is a specific case of the maximum likelihood. Therefore, regression models are subject to the same challenge in terms of overfitting as any other discriminative models. You are already aware of the fact that regularization is used to reduce model complexity and avoid overfitting, as stated in the Overfitting section in Chapter 2, Hello World!

L_n roughness penalty

Regularization consists of adding a J(w) penalty function to the loss function (or RSS in the case of a regressive classifier) in order to prevent the model parameters (also known as weights) from reaching high values. A model that fits a training set very well tends to have many features variables with relatively large weights. This process is known as shrinkage. Practically, shrinkage involves adding a function with model parameters as an argument to the loss function (M5):

The penalty function is completely independent of the training set {x...