Book Image

Machine Learning with Go Quick Start Guide

By : Michael Bironneau, Toby Coleman
Book Image

Machine Learning with Go Quick Start Guide

By: Michael Bironneau, Toby Coleman

Overview of this book

Machine learning is an essential part of today's data-driven world and is extensively used across industries, including financial forecasting, robotics, and web technology. This book will teach you how to efficiently develop machine learning applications in Go. The book starts with an introduction to machine learning and its development process, explaining the types of problems that it aims to solve and the solutions it offers. It then covers setting up a frictionless Go development environment, including running Go interactively with Jupyter notebooks. Finally, common data processing techniques are introduced. The book then teaches the reader about supervised and unsupervised learning techniques through worked examples that include the implementation of evaluation metrics. These worked examples make use of the prominent open-source libraries GoML and Gonum. The book also teaches readers how to load a pre-trained model and use it to make predictions. It then moves on to the operational side of running machine learning applications: deployment, Continuous Integration, and helpful advice for effective logging and monitoring. At the end of the book, readers will learn how to set up a machine learning project for success, formulating realistic success criteria and accurately translating business requirements into technical ones.
Table of Contents (9 chapters)

Principal component analysis

Principal component analysis (PCA) is a way to reduce the number of dimensions in a dataset. We can think of it as a way of compressing a dataset. Suppose you have 100 different variables in your dataset. It may be the case that many of these variables are correlated with each other. If this is the case, then it is possible to explain most of the variation in the data by combining variables to build a smaller set of data. PCA performs this task: it tries to find linear combinations of your input variables, and reports how much variation is explained by each combination.

PCA is a method for reducing the dimensions in a dataset: in effect, summarizing it so that you can focus on the most important features, which explain most of the variation in the dataset.

PCA can be useful for machine learning in two ways:

  • It can be a useful preprocessing step before...