Challenges with labeling data at scale

Preparing, building, training and tuning, deploying, and managing ML models

Discussion of data preparation capabilities

Feature tour of model-building capabilities

Feature tour of training and tuning capabilities

Feature tour of model management and deployment capabilities

Chapter 2: Data Science Environments

Machine learning use case and dataset

Creating data science environment

Chapter 3: Data Labeling with Amazon SageMaker Ground Truth

Challenges with labeling data at scale

Addressing unique labeling requirements with custom labeling workflows

Improving labeling quality using multiple workers

Using active learning to reduce labeling time

Security and permissions

Chapter 4: Data Preparation at Scale Using Amazon SageMaker Data Wrangler and Processing

Visual data preparation with Data Wrangler

Bias detection and explainability with Data Wrangler and Clarify

Data preparation at scale with SageMaker Processing

Chapter 5: Centralized Feature Repository with Amazon SageMaker Feature Store

Amazon SageMaker Feature Store essentials

Creating feature groups

Populating feature groups

Retrieving features from feature groups

Creating reusable features to reduce feature inconsistencies and inference latency

Designing solutions for near real-time ML predictions

Section 2: Model Training Challenges

Chapter 6: Training and Tuning at Scale

ML training at scale with SageMaker distributed libraries

Automated model tuning with SageMaker hyperparameter tuning

Organizing and tracking training jobs with SageMaker Experiments

Chapter 7: Profile Training Jobs with Amazon SageMaker Debugger

Amazon SageMaker Debugger essentials

Real-time monitoring of training jobs using built-in and custom rules

Gaining insight into the training infrastructure and training framework

Further reading

Section 3: Manage and Monitor Models

Chapter 8: Managing Models at Scale Using a Model Registry

Choosing a model registry solution

Using a model registry

Managing models using the Amazon SageMaker model registry

Chapter 9: Updating Production Models Using Amazon SageMaker Endpoint Production Variants

Basic concepts of Amazon SageMaker Endpoint Production Variants

Deployment strategies for updating ML models with SageMaker Endpoint Production Variants

Selecting an appropriate deployment strategy

Chapter 10: Optimizing Model Hosting and Inference Costs

Real-time inference versus batch inference

Deploying multiple models behind a single inference endpoint

Scaling inference endpoints to meet inference traffic demands

Using Elastic Inference for deep learning models

Optimizing models with SageMaker Neo

Chapter 11: Monitoring Production Models with Amazon SageMaker Model Monitor and Clarify

Basic concepts of Amazon SageMaker Model Monitor and Amazon SageMaker Clarify

End-to-end architectures for monitoring ML models

Best practices for monitoring ML models

Considerations for automating your SageMaker ML workflows

Section 4: Automate and Operationalize Machine Learning

Chapter 12: Machine Learning Automated Workflows

Building ML workflows with Amazon SageMaker Pipelines

Creating CI/CD pipelines using Amazon SageMaker Projects

Best practices for operationalizing ML workloads

Chapter 13:Well-Architected Machine Learning with Amazon SageMaker

Best practices for securing ML workloads

Best practices for reliable ML workloads

Best practices for building performant ML workloads

Best practices for cost-optimized ML workloads

Examining an overview of the AWS multi-account environment

Chapter 14: Managing SageMaker Features across Accounts

Understanding the benefits of using multiple AWS accounts with Amazon SageMaker

Examining multi-account considerations with Amazon SageMaker