Learn Amazon SageMaker

By : Julien Simon

Learn Amazon SageMaker

By: Julien Simon

Overview of this book

Amazon SageMaker enables you to quickly build, train, and deploy machine learning (ML) models at scale, without managing any infrastructure. It helps you focus on the ML problem at hand and deploy high-quality models by removing the heavy lifting typically involved in each step of the ML process. This book is a comprehensive guide for data scientists and ML developers who want to learn the ins and outs of Amazon SageMaker. You’ll understand how to use various modules of SageMaker as a single toolset to solve the challenges faced in ML. As you progress, you’ll cover features such as AutoML, built-in algorithms and frameworks, and the option for writing your own code and algorithms to build ML models. Later, the book will show you how to integrate Amazon SageMaker with popular deep learning libraries such as TensorFlow and PyTorch to increase the capabilities of existing models. You’ll also learn to get the models to production faster with minimum effort and at a lower cost. Finally, you’ll explore how to use Amazon SageMaker Debugger to analyze, detect, and highlight problems to understand the current model state and improve model accuracy. By the end of this Amazon book, you’ll be able to use Amazon SageMaker on the full spectrum of ML workflows, from experimentation, training, and monitoring to scaling, deployment, and automation.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: Introduction to Amazon SageMaker

Free Chapter

Chapter 1: Introduction to Amazon SageMaker

Technical requirements

Exploring the capabilities of Amazon SageMaker

Demonstrating the strengths of Amazon SageMaker

Setting up Amazon SageMaker on your local machine

Setting up an Amazon SageMaker notebook instance

Setting up Amazon SageMaker Studio

Summary

Chapter 2: Handling Data Preparation Techniques

Technical requirements

Discovering Amazon SageMaker Ground Truth

Exploring Amazon SageMaker Processing

Processing data with other AWS services

Summary

Section 2: Building and Training Models

Chapter 3: AutoML with Amazon SageMaker Autopilot

Technical requirements

Discovering Amazon SageMaker Autopilot

Using SageMaker Autopilot in SageMaker Studio

Using the SageMaker Autopilot SDK

Diving deep on SageMaker Autopilot

Summary

Chapter 4: Training Machine Learning Models

Technical requirements

Discovering the built-in algorithms in Amazon SageMaker

Training and deploying models with built-in algorithms

Using the SageMaker SDK with built-in algorithms

Working with more built-in algorithms

Summary

Chapter 5: Training Computer Vision Models

Technical requirements

Discovering the CV built-in algorithms in Amazon SageMaker

Preparing image datasets

Using the built-in CV algorithms

Summary

Chapter 6: Training Natural Language Processing Models

Technical requirements

Discovering the NLP built-in algorithms in Amazon SageMaker

Preparing natural language datasets

Using the built-in algorithms for NLP

Summary

Chapter 7: Extending Machine Learning Services Using Built-In Frameworks

Technical requirements

Discovering the built-in frameworks in Amazon SageMaker

Running your framework code on Amazon SageMaker

Using the built-in frameworks

Summary

Chapter 8: Using Your Algorithms and Code

Technical requirements

Understanding how SageMaker invokes your code

Using the SageMaker training toolkit with scikit-learn

Building a fully custom container for scikit-learn

Building a fully custom container for R

Training and deploying with XGBoost and MLflow

Training and deploying with XGBoost and Sagify

Summary

Section 3: Diving Deeper on Training

Chapter 9: Scaling Your Training Jobs

Technical requirements

Understanding when and how to scale

Streaming datasets with pipe mode

Using other storage services

Distributing training jobs

Training an Image Classification model on ImageNet

Summary

Chapter 10: Advanced Training Techniques

Technical requirements

Optimizing training costs with Managed Spot Training

Optimizing hyperparameters with Automatic Model Tuning

Exploring models with SageMaker Debugger

Summary

Section 4: Managing Models in Production

Chapter 11: Deploying Machine Learning Models

Technical requirements

Examining model artifacts

Managing real-time endpoints

Deploying batch transformers

Deploying inference pipelines

Monitoring predictions with Amazon SageMaker Model Monitor

Deploying models to container services

Summary

Chapter 12: Automating Machine Learning Workflows

Technical requirements

Automating with AWS CloudFormation

Automating with the AWS Cloud Development Kit

Automating with AWS Step Functions

Summary

Chapter 13: Optimizing Prediction Cost and Performance

Technical requirements

Autoscaling an endpoint

Deploying a multi-model endpoint

Deploying a model with Amazon Elastic Inference

Compiling models with Amazon SageMaker Neo

Building a cost optimization checklist

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Exploring the capabilities of Amazon SageMaker

Amazon SageMaker was launched at AWS re:Invent 2017. Since then, a lot of new features have been added: you can see the full (and ever-growing) list at https://aws.amazon.com/about-aws/whats-new/machine-learning.

In this section, you'll learn about the main capabilities of Amazon SageMaker and their purpose. Don't worry, we'll dive deep on each of them in later chapters. We will also talk about the SageMaker Application Programming Interfaces (APIs), and the Software Development Kits (SDKs) that implement them.

The main capabilities of Amazon SageMaker

At the core of Amazon SageMaker is the ability to build, train, optimize, and deploy models on fully managed infrastructure, and at any scale. This lets you focus on studying and solving the ML problem at hand, instead of spending time and resources on building and managing infrastructure. Simply put, you can go from building to training to deploying more quickly. Let's zoom in on each step and highlight relevant SageMaker capabilities.

Building

Amazon SageMaker provides you with two development environments:

Notebook instances: Fully managed Amazon EC2 instances that come preinstalled with the most popular tools and libraries: Jupyter, Anaconda, and so on.
Amazon SageMaker Studio: A full-fledged integrated development environment for ML projects.

When it comes to experimenting with algorithms, you can choose from the following:

A collection of 17 built-in algorithms for ML and deep learning, already implemented and optimized to run efficiently on AWS. No ML code to write!
A collection of built-in open source frameworks (TensorFlow, PyTorch, Apache MXNet, scikit-learn, and more), where you simply bring your own code.
Your own code running in your own container: custom Python, R, C++, Java, and so on.
Algorithms and pretrained models from AWS Marketplace for ML (https://aws.amazon.com/marketplace/solutions/machine-learning).

In addition, Amazon SageMaker Autopilot uses AutoML to automatically build, train, and optimize models without the need to write a single line of ML code.

Amazon SageMaker also includes two major capabilities that help with building and preparing datasets:

Amazon SageMaker Ground Truth: Annotate datasets at any scale. Workflows for popular use cases are built in (image detection, entity extraction, and more), and you can implement your own. Annotation jobs can be distributed to workers that belong to private, third-party, or public workforces.
Amazon SageMaker Processing: Run data processing and model evaluation batch jobs, using either scikit-learn or Spark.

Training

As mentioned earlier, Amazon SageMaker takes care of provisioning and managing your training infrastructure. You'll never spend any time managing servers, and you'll be able to focus on ML. On top of this, SageMaker brings advanced capabilities such as the following:

Managed storage using either Amazon S3, Amazon EFS, or Amazon FSx for Lustre depending on your performance requirements.
Managed spot training, using Amazon EC2 Spot instances for training in order to reduce costs by up to 80%.
Distributed training automatically distributes large-scale training jobs on a cluster of managed instances
Pipe mode streams infinitely large datasets from Amazon S3 to the training instances, saving the need to copy data around.
Automatic model tuning runs hyperparameter optimization in order to deliver high-accuracy models more quickly.
Amazon SageMaker Experiments easily tracks, organizes, and compares all your SageMaker jobs.
Amazon SageMaker Debugger captures the internal model state during training, inspects it to observe how the model learns, and detects unwanted conditions that hurt accuracy.

Deploying

Just as with training, Amazon SageMaker takes care of all your deployment infrastructure, and brings a slew of additional features:

Real-time endpoints: This creates an HTTPS API that serves predictions from your model. As you would expect, autoscaling is available.
Batch transform: This uses a model to predict data in batch mode.
Infrastructure monitoring with Amazon CloudWatch: This helps you to view real-time metrics and keep track of infrastructure performance.
Amazon SageMaker Model Monitor: This captures data sent to an endpoint, and compares it with a baseline to identify and alert on data quality issues (missing features, data drift, and more).
Amazon SageMaker Neo: This compiles models for a specific hardware architecture, including embedded platforms, and deploys an optimized version using a lightweight runtime.
Amazon Elastic Inference: This adds fractional GPU acceleration to CPU-based instances in order to find the best cost/performance ratio for your prediction infrastructure.

The Amazon SageMaker API

Just like all other AWS services, Amazon SageMaker is driven by APIs that are implemented in the language SDKs supported by AWS (https://aws.amazon.com/tools/). In addition, a dedicated Python SDK, aka the 'SageMaker SDK,' is also available. Let's look at both, and discuss their respective benefits.

The AWS language SDKs

Language SDKs implement service-specific APIs for all AWS services: S3, EC2, and so on. Of course, they also include SageMaker APIs, which are documented at https://docs.aws.amazon.com/sagemaker/latest/dg/api-and-sdk-reference.html.

When it comes to data science and ML, Python is the most popular language, so let's take a look at the SageMaker APIs available in boto3, the AWS SDK for the Python language (https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker.html). These APIs are quite low level and verbose: for example, create_training_job() has a lot of JSON parameters that don't look very obvious. You can see some of them in the next screenshot. You may think that this doesn't look very appealing for everyday ML experimentation… and I would totally agree!

Figure 1.1 A partial view of the create_training_job() API in boto3

Indeed, these service-level APIs are not meant to be used for experimentation in notebooks. Their purpose is automation, through either bespoke scripts or Infrastructure-as-Code tools such as AWS CloudFormation (https://aws.amazon.com/cloudformation) and Terraform (https://terraform.io). Your DevOps team will use them to manage production, where they do need full control over each possible parameter.

So, what should you use for experimentation? You should use the Amazon SageMaker SDK.

The Amazon SageMaker SDK

The Amazon SageMaker SDK (https://github.com/aws/sagemaker-python-sdk) is a Python SDK specific to Amazon SageMaker. You can find its documentation at https://sagemaker.readthedocs.io/en/stable/.

Note:

The code examples in this book are based on the first release of the SageMaker SDK v2, released in August 2020. For the sake of completeness, and to help you migrate your own notebooks, the companion GitHub repository includes examples for SDK v1 and v2.

Here, the abstraction level is much higher: the SDK contains objects for models, estimators, models, predictors, and so on. We're definitely back into ML territory.

For instance, this SDK makes it extremely easy and comfortable to fire up a training job (one line of code) and to deploy a model (one line of code). Infrastructure concerns are abstracted away, and we can focus on ML instead. Here's an example. Don't worry about the details for now:

# Configure the training job my_estimator = TensorFlow(    'my_script.py',    role=my_sageMaker_role,    instance_type='ml.p3.2xlarge',    instance_count=1,    framework_version='2.1.0')
# Train the model my_estimator.fit('s3://my_bucket/my_training_data/')
# Deploy the model to an HTTPS endpoint my_predictor = my_estimator.deploy(    initial_instance_count=1,     instance_type='ml.c5.2xlarge')

Now that we know a little more about Amazon SageMaker, let's see how it helps typical customers make their ML workflows more agile and more efficient.

Learn Amazon SageMaker

By : Julien Simon

Learn Amazon SageMaker

By: Julien Simon

Overview of this book

Related Content you might be interested in

Current Title:

Learn Amazon SageMaker

Machine Learning with Amazon SageMaker Cookbook

Accelerate Deep Learning Workloads with Amazon SageMaker

Getting Started with Amazon SageMaker Studio

Exploring the capabilities of Amazon SageMaker

The main capabilities of Amazon SageMaker

Building

Training

Deploying

The Amazon SageMaker API

The AWS language SDKs

The Amazon SageMaker SDK