Scala for Machine Learning

Book Image

Scala for Machine Learning

By : Patrick R. Nicolas

Book Image

Scala for Machine Learning

By: Patrick R. Nicolas

Overview of this book

Scala for Machine Learning

Scala for Machine Learning

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Getting Started

Getting Started

Mathematical notation for the curious

Why machine learning?

Model categorization

Taxonomy of machine learning algorithms

Don't reinvent the wheel!

Tools and frameworks

Let's kick the tires

Hello World!

Defining a methodology

Monadic data transformation

A workflow computational model

Assessing a model

Data Preprocessing

Data Preprocessing

Time series in Scala

Moving averages

Fourier analysis

The discrete Kalman filter

Alternative preprocessing techniques

Unsupervised Learning

Unsupervised Learning

Dimension reduction

Performance considerations

Naïve Bayes Classifiers

Naïve Bayes Classifiers

Probabilistic graphical models

Naïve Bayes classifiers

The Multivariate Bernoulli classification

Naïve Bayes and text mining

Regression and Regularization

Regression and Regularization

Linear regression

Numerical optimization

Logistic regression

Sequential Data Models

Sequential Data Models

Markov decision processes

The hidden Markov model

Conditional random fields

Regularized CRFs and text analytics

Comparing CRF and HMM

Performance consideration

Kernel Models and Support Vector Machines

Kernel Models and Support Vector Machines

Kernel functions

Support vector machines

Support vector classifiers – SVC

Anomaly detection with one-class SVC

Support vector regression

Performance considerations

Artificial Neural Networks

Artificial Neural Networks

Feed-forward neural networks

The multilayer perceptron

Convolution neural networks

Benefits and limitations

Genetic Algorithms

Genetic Algorithms

Genetic algorithms and machine learning

Genetic algorithm components

GA for trading strategies

Advantages and risks of genetic algorithms

Reinforcement Learning

Reinforcement Learning

Reinforcement learning

Learning classifier systems

Scalable Frameworks

Scalable Frameworks

Scalability with Actors

Basic Concepts

Scala programming

Suggested online courses

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

The Multivariate Bernoulli classification

The previous example uses the Gaussian distribution for features that are essentially binary (UP = 1 and DOWN = 0) to represent the change in value. The mean value is computed as the ratio of the number of observations for which x_i = UP over the total number of observations.

As stated in the first section, the Gaussian distribution is more appropriate for either continuous features or binary features for very large labeled datasets. The example is the perfect candidate for the Bernoulli model.

Model

The Bernoulli model differs from the Naïve Bayes classifier in such a way that it penalizes the feature x that does not have any observation; the Naïve Bayes classifier ignores it [5:10].

Note

The Bernoulli mixture model

M8: For a feature function f_k with f_k = 1, if the feature is observed, and a value of 0 otherwise, and the probability p of the observed feature x_k belongs to the class C_j, then the posterior probability is computed as follows:

Implementation...