Hands-On Machine Learning with ML.NET

By : Jarred Capellman

Hands-On Machine Learning with ML.NET

By: Jarred Capellman

Overview of this book

Machine learning (ML) is widely used in many industries such as science, healthcare, and research and its popularity is only growing. In March 2018, Microsoft introduced ML.NET to help .NET enthusiasts in working with ML. With this book, you’ll explore how to build ML.NET applications with the various ML models available using C# code. The book starts by giving you an overview of ML and the types of ML algorithms used, along with covering what ML.NET is and why you need it to build ML apps. You’ll then explore the ML.NET framework, its components, and APIs. The book will serve as a practical guide to helping you build smart apps using the ML.NET library. You’ll gradually become well versed in how to implement ML algorithms such as regression, classification, and clustering with real-world examples and datasets. Each chapter will cover the practical implementation, showing you how to implement ML within .NET applications. You’ll also learn to integrate TensorFlow in ML.NET applications. Later you’ll discover how to store the regression model housing price prediction result to the database and display the real-time predicted results from the database on your web application using ASP.NET Core Blazor and SignalR. By the end of this book, you’ll have learned how to confidently perform basic to advanced-level machine learning tasks in ML.NET.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Section 1: Fundamentals of Machine Learning and ML.NET

Free Chapter

Getting Started with Machine Learning and ML.NET

The importance of learning about machine learning today

The model building process

Exploring types of learning

Exploring various machine learning algorithms

What is ML.NET?

Summary

Setting Up the ML.NET Environment

Setting up your development environment

Creating your first ML.NET application

Evaluating the model

Summary

Section 2: ML.NET Models

Regression Model

Breaking down regression models

Creating the linear regression application

Creating the logistic regression application

Evaluating a regression model

Summary

Classification Model

Breaking down classification models

Creating a binary classification application

Creating a multi-class classification application

Evaluating a classification model

Summary

Clustering Model

Breaking down the k-means algorithm

Creating the clustering application

Evaluating a k-means model

Summary

Anomaly Detection Model

Breaking down anomaly detection

Creating a time series application

Creating an anomaly detection application

Evaluating a randomized PCA model

Summary

Matrix Factorization Model

Breaking down matrix factorizations

Creating a matrix factorization application

Evaluating a matrix factorization model

Summary

Section 3: Real-World Integrations with ML.NET

Using ML.NET with .NET Core and Forecasting

Breaking down the .NET Core application architecture

Creating the stock price estimator application

Exploring additional production application enhancements

Summary

Using ML.NET with ASP.NET Core

Breaking down ASP.NET Core

Creating the file classification web application

Exploring additional ideas for improvements

Summary

Using ML.NET with UWP

Breaking down the UWP architecture

Creating the web browser classification application

Additional ideas for improvements

Summary

Section 4: Extending ML.NET

Training and Building Production Models

Investigating feature engineering

Obtaining training and testing datasets

Creating your model-building pipeline

Summary

Using TensorFlow with ML.NET

Breaking down Google's Inception model

Creating the WPF image classification application

Additional ideas for improvements

Summary

Using ONNX with ML.NET

Breaking down ONNX and YOLO

Creating the ONNX object detection application

Exploring additional production application enhancements

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

The importance of learning about machine learning today

In recent years, machine learning and artificial intelligence have become an integral part of many of our lives in use cases as diverse as finding cancer cells in an MRI and facial and object recognition during a professional basketball game. Over the course of just the four years between 2013 and 2017, machine learning patents alone grew 34%, while spending is estimated to grow to $57.6B by 2021 (https://www.forbes.com/sites/louiscolumbus/2018/02/18/roundup-of-machine-learning-forecasts-and-market-estimates-2018/#794d6f6c2225).

Despite its status as a growing technology, the term machine learning was coined back in 1959 by Arthur Samuel—so what caused the 60-year gap before its adoption? Perhaps the two most significant factors were the availability of technology able to process model predictions fast enough, and the amount of data being captured every minute digitally. According to DOMO Inc, a study in 2017 concluded that 2.5 quintillion bytes were generated daily and that at that time, 90% of the world's data was created between 2015 and 2017 (https://www.domo.com/learn/data-never-sleeps-5?aid=ogsm072517_1&sf100871281=1). By 2025, it is estimated that 463 exabytes of data are going to be created daily (https://www.visualcapitalist.com/how-much-data-is-generated-each-day/), much of which will come from cars, videos, pictures, IoT devices, emails, and even devices that have not made the transition to the smart movement yet.

The amount that data has grown in the last decade has led to questions about how a business or corporation can use such data for better sales forecasting, anticipating a customer's needs, or detecting malicious bytes in a file. Traditional statistical approaches could potentially require exponentially more staff to keep up with current demands, let alone scale with the data captured. Take, for instance, Google Maps. With Google's acquisition of Waze in 2013, users of Google Maps have been provided with extremely accurate routing suggestions based on the anonymized GPS data of its users. With this model, the more data points (in this case GPS data from smartphones), the better predictions Google can make for your travel. As we will discuss later in this chapter, quality datasets are a critical component of machine learning, especially in the case of Google Maps, where, without a proper dataset, the user experience would be subpar.

In addition, the speed of computer hardware, specifically specialized hardware tailored for machine learning, has also played a role. The use of Application-Specific Integrated Circuits (ASICs) has grown exponentially. One of the most popular ASICs on the market is the Google Tensor Processing Unit (TPU). Originally released in 2016, it has since gone through two iterations and provides cloud-based acceleration for machine learning tasks on Google Cloud Platform. Other cloud platforms, such as Amazon's AWS and Microsoft's Azure, also provide FPGAs.

Additionally, Graphics Processing Units (GPUs) from both AMD and NVIDIA are accelerating both cloud-based and local workloads, with ROCm Platform and CUDA-accelerated libraries respectively. In addition to accelerated workloads, typical professional GPUs offered by AMD and NVIDIA provide a much higher density of processors than the traditional CPU-only approach. For instance, the AMD Radeon Instinct MI60 provides 4,096 stream processors. While not a full-fledged x86 core, it is not a one-to-one comparison, and the peak performance of double-precision floating-point tasks is rated at 7.373 TFLOPs compared to the 2.3 TFLOPs in AMD's extremely powerful EPYC 7742 server CPU. From a cost and scalability perspective, utilizing GPUs in even a workstation configuration would provide an exponential reduction in training time if the algorithms were accelerated to take advantage of the more specialized cores offered by AMD and NVIDIA. Fortunately, ML.NET provides GPU acceleration with little additional effort.

From a software engineering career perspective, with this growth and demand far outpacing the supply, there has never been a better time to develop machine learning skills as a software engineer. Furthermore, software engineers also possess skills that traditional data scientists do not have – for instance, being able to automate tasks such as the model building process rather than relying on manual scripts. Another example of where a software engineer can provide more value is by adding both unit tests and efficacy tests as part of the full pipeline when training a model. In a large production application, having these automated tests is critical to avoid production issues.

Finally, in 2018, for the first time ever, data was considered more valuable than oil. As industries continue to adopt the use of data gathering and existing industries take advantage of the data they have, machine learning will be intertwined with the data. Machine learning to data is what refining plants are to oil.

Hands-On Machine Learning with ML.NET

By : Jarred Capellman

Hands-On Machine Learning with ML.NET

By: Jarred Capellman

Overview of this book

Related Content you might be interested in

Current Title:

Hands-On Machine Learning with ML.NET

The importance of learning about machine learning today