Accelerate Deep Learning Workloads with Amazon SageMaker

By : Vadim Dabravolski

Accelerate Deep Learning Workloads with Amazon SageMaker

By: Vadim Dabravolski

Overview of this book

Over the past 10 years, deep learning has grown from being an academic research field to seeing wide-scale adoption across multiple industries. Deep learning models demonstrate excellent results on a wide range of practical tasks, underpinning emerging fields such as virtual assistants, autonomous driving, and robotics. In this book, you will learn about the practical aspects of designing, building, and optimizing deep learning workloads on Amazon SageMaker. The book also provides end-to-end implementation examples for popular deep-learning tasks, such as computer vision and natural language processing. You will begin by exploring key Amazon SageMaker capabilities in the context of deep learning. Then, you will explore in detail the theoretical and practical aspects of training and hosting your deep learning models on Amazon SageMaker. You will learn how to train and serve deep learning models using popular open-source frameworks and understand the hardware and software options available for you on Amazon SageMaker. The book also covers various optimizations technique to improve the performance and cost characteristics of your deep learning workloads. By the end of this book, you will be fluent in the software and hardware aspects of running deep learning workloads using Amazon SageMaker.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Part 1: Introduction to Deep Learning on Amazon SageMaker

Free Chapter

Chapter 1: Introducing Deep Learning with Amazon SageMaker

Technical requirements

Exploring DL with Amazon SageMaker

Choosing Amazon SageMaker for DL workloads

Exploring SageMaker’s managed training stack

Using SageMaker’s managed hosting stack

Integration with AWS services

Summary

Chapter 2: Deep Learning Frameworks and Containers on SageMaker

Technical requirements

Exploring DL frameworks on SageMaker

Using SageMaker DL containers

Developing a BYO container for inference

Summary

Chapter 3: Managing SageMaker Development Environment

Technical requirements

Selecting a development environment for SageMaker

Debugging SageMaker code locally

Summary

Chapter 4: Managing Deep Learning Datasets

Technical requirements

Selecting storage solutions for ML datasets

Processing data at scale

Optimizing data storage and retrieval

Summary

Part 2: Building and Training Deep Learning Models

Chapter 5: Considering Hardware for Deep Learning Training

Technical requirements

Selecting optimal compute instances

Improving network throughput with EFA

Compiling models for GPU devices with Training Compiler

Summary

Chapter 6: Engineering Distributed Training

Technical requirements

Engineering data parallel training

Engineering model parallel training jobs

Optimizing distributed training jobs

Summary

Chapter 7: Operationalizing Deep Learning Training

Technical requirements

Debugging training jobs

Profiling your DL training

Hyperparameter optimization

Using EC2 Spot Instances

Summary

Part 3: Serving Deep Learning Models

Chapter 8: Considering Hardware for Inference

Technical requirements

Selecting hardware accelerators in AWS Cloud

Compiling models for inference

Summary

Chapter 9: Implementing Model Servers

Technical requirements

Using TFS

Using PTS

Using NVIDIA Triton

Summary

Chapter 10: Operationalizing Inference Workloads

Technical requirements

Managing inference deployments

Monitoring inference workloads

Selecting your workload configuration

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Using NVIDIA Triton

NVIDIA Triton is an open source model server developed by NVIDIA. It supports multiple DL frameworks (such as TensorFlow, PyTorch, ONNX, Python, and OpenVINO), as well various hardware platforms and runtime environments (NVIDIA GPUs, x86 and ARM CPUs, and AWS Inferentia). Triton can be used for inference in cloud and data center environments and edge or mobile devices. Triton is optimized for performance and scalability on various CPU and GPU platforms. NVIDIA provides a specialized utility for performance analysis and model analysis to improve Triton’s performance.

Integration with SageMaker

You can use Triton model servers by utilizing a pre-built SageMaker DL container with it. Note that SageMaker Triton containers are not open source. You can find the latest list of Triton containers here: https://github.com/aws/deep-learning-containers/blob/master/available_images.md#nvidia-triton-inference-containers-sm-support-only.

SageMaker doesn’...

Accelerate Deep Learning Workloads with Amazon SageMaker

By : Vadim Dabravolski

Accelerate Deep Learning Workloads with Amazon SageMaker

By: Vadim Dabravolski

Overview of this book

Related Content you might be interested in

Current Title:

Accelerate Deep Learning Workloads with Amazon SageMaker

Applied Machine Learning and High-Performance Computing on AWS

Amazon SageMaker Best Practices

Getting Started with Amazon SageMaker Studio

Using NVIDIA Triton

Integration with SageMaker