Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Scala for Machine Learning

By : Patrick R. Nicolas

3.8 (12)

Scala for Machine Learning

3.8 (12)

By: Patrick R. Nicolas

Overview of this book

Are you curious about AI? All you need is a good understanding of the Scala programming language, a basic knowledge of statistics, a keen interest in Big Data processing, and this book!

Preface

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Free Chapter

1. Getting Started

1. Getting Started

Mathematical notation for the curious

Why machine learning?

Why Scala?

Model categorization

Taxonomy of machine learning algorithms

Tools and frameworks

Source code

Let's kick the tires

Summary

2. Hello World!

2. Hello World!

Modeling

Designing a workflow

Assessing a model

Summary

3. Data Preprocessing

3. Data Preprocessing

Time series

Moving averages

Fourier analysis

The Kalman filter

Alternative preprocessing techniques

Summary

4. Unsupervised Learning

4. Unsupervised Learning

Clustering

Dimension reduction

Performance considerations

Summary

5. Naïve Bayes Classifiers

5. Naïve Bayes Classifiers

Probabilistic graphical models

Naïve Bayes classifiers

Multivariate Bernoulli classification

Naïve Bayes and text mining

Pros and cons

Summary

6. Regression and Regularization

6. Regression and Regularization

Linear regression

Regularization

Numerical optimization

The logistic regression

Summary

7. Sequential Data Models

7. Sequential Data Models

Markov decision processes

The hidden Markov model (HMM)

Conditional random fields

CRF and text analytics

Comparing CRF and HMM

Performance consideration

Summary

8. Kernel Models and Support Vector Machines

8. Kernel Models and Support Vector Machines

Kernel functions

The support vector machine (SVM)

Support vector classifier (SVC)

Anomaly detection with one-class SVC

Support vector regression (SVR)

Performance considerations

Summary

9. Artificial Neural Networks

9. Artificial Neural Networks

Feed-forward neural networks (FFNN)

The multilayer perceptron (MLP)

Evaluation

Benefits and limitations

Summary

10. Genetic Algorithms

10. Genetic Algorithms

Evolution

Genetic algorithms and machine learning

Genetic algorithm components

Implementation

GA for trading strategies

Advantages and risks of genetic algorithms

Summary

11. Reinforcement Learning

11. Reinforcement Learning

Introduction

Learning classifier systems

Summary

12. Scalable Frameworks

12. Scalable Frameworks

Overview

Scala

Scalability with Actors

Akka

Apache Spark

Summary

A. Basic Concepts

A. Basic Concepts

Scala programming

Mathematics

Finances 101

Suggested online courses

References

Index

Index

Clustering

Problems involving a large number of features for large datasets become quickly intractable, and it is quite difficult to evaluate the independence between features. Any computation that requires some level of optimization and, at a minimum, computation of first order derivatives requires a significant amount of computing power to manipulate high-dimension matrices. As with many engineering fields, a divide-and-conquer approach to classifying very large datasets is quite effective. The objective is to reduce continuous, infinite, or very large datasets into a small group of observations that share some common attributes.

Clustering

Visualization of data clustering

This approach is known as vector quantization. Vector quantization is a method that divides a set of observations into groups of similar size. The main benefit of vector quantization is that the analysis using a representative of each group is far simpler than an analysis of the entire dataset [4:2].

Clustering, also known as cluster...

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Scala for Machine Learning

Search

Your notes and bookmarks