Scala for Machine Learning

Book Image

Scala for Machine Learning

By : Patrick R. Nicolas

Book Image

Scala for Machine Learning

By: Patrick R. Nicolas

Overview of this book

Scala for Machine Learning

Scala for Machine Learning

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Getting Started

Getting Started

Mathematical notation for the curious

Why machine learning?

Model categorization

Taxonomy of machine learning algorithms

Don't reinvent the wheel!

Tools and frameworks

Let's kick the tires

Hello World!

Defining a methodology

Monadic data transformation

A workflow computational model

Assessing a model

Data Preprocessing

Data Preprocessing

Time series in Scala

Moving averages

Fourier analysis

The discrete Kalman filter

Alternative preprocessing techniques

Unsupervised Learning

Unsupervised Learning

Dimension reduction

Performance considerations

Naïve Bayes Classifiers

Naïve Bayes Classifiers

Probabilistic graphical models

Naïve Bayes classifiers

The Multivariate Bernoulli classification

Naïve Bayes and text mining

Regression and Regularization

Regression and Regularization

Linear regression

Numerical optimization

Logistic regression

Sequential Data Models

Sequential Data Models

Markov decision processes

The hidden Markov model

Conditional random fields

Regularized CRFs and text analytics

Comparing CRF and HMM

Performance consideration

Kernel Models and Support Vector Machines

Kernel Models and Support Vector Machines

Kernel functions

Support vector machines

Support vector classifiers – SVC

Anomaly detection with one-class SVC

Support vector regression

Performance considerations

Artificial Neural Networks

Artificial Neural Networks

Feed-forward neural networks

The multilayer perceptron

Convolution neural networks

Benefits and limitations

Genetic Algorithms

Genetic Algorithms

Genetic algorithms and machine learning

Genetic algorithm components

GA for trading strategies

Advantages and risks of genetic algorithms

Reinforcement Learning

Reinforcement Learning

Reinforcement learning

Learning classifier systems

Scalable Frameworks

Scalable Frameworks

Scalability with Actors

Basic Concepts

Scala programming

Suggested online courses

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Summary

In this chapter, we established the framework for the different data processing units that will be introduced in this book. There is a very good reason why the topics of model validation and overfitting are explored early in this book. There is no point in building models and selecting algorithms if we do not have a methodology to evaluate their relative merits.

In this chapter, you were introduced to the following:

The concept of monadic transformation for implicit and explicit models
The versatility and cleanness of the Cake pattern and mixins composition in Scala as an effective scaffolding tool for data processing
A robust methodology to validate machine learning models
The challenge in fitting models to both training and real-world data

The next chapter will address the problem of overfitting by identifying outliers and reducing noise in data.