Book Image

Scala for Machine Learning

By : Patrick R. Nicolas
Book Image

Scala for Machine Learning

By: Patrick R. Nicolas

Overview of this book

Table of Contents (20 chapters)
Scala for Machine Learning
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Monadic data transformation


The first step is to define a trait and method that describe the transformation of data by the computation units of a workflow. The data transformation is the foundation of any workflow for processing and classifying a dataset, training and validating a model, and displaying results.

There are two symbolic models used for defining a data processing or data transformation:

  • Explicit model: The developer creates a model explicitly from a set of configuration parameters. Most of deterministic algorithms and unsupervised learning techniques use an explicit model.

  • Implicit model: The developer provides a training set that is a set of labeled observations (observations with an expected outcome). A classifier extracts a model through the training set. Supervised learning techniques rely on models implicitly generated from labeled data.

Error handling

The simplest form of data transformation is morphism between the two U and V types. The data transformation enforces a contract...