Chapter 8: SageMaker AI Model Deployment Options and Strategies

Book Overview & Buying
Table Of Contents

Machine Learning Engineering on AWS - Second Edition

By : Joshua Arvin Lat

4 (1)

Buy this Book

Machine Learning Engineering on AWS

4 (1)

By: Joshua Arvin Lat

Buy this Book

Overview of this book

Modern AI systems increasingly leverage large language models, retrieval-augmented generation, and AI agents to power generative AI applications in the cloud. As organizations operationalize these systems at scale, there is a growing need for engineers with strong machine learning engineering expertise. To stay ahead in this rapidly evolving field, you need a deep understanding of AI and ML concepts as well as, practical, hands-on experience with the platforms and tools used to build and operate production-grade AI systems. Machine Learning Engineering on AWS is a practical guide that shows you how to use AWS services such as Amazon Bedrock and Amazon SageMaker AI to fine-tune, evaluate, and deploy LLMs and generative AI systems. You'll learn how to develop RAG-powered systems, build and deploy AI agents using Bedrock AgentCore and Strands Agents, evaluate models using LLM-as-a-judge techniques, and automate LLMOps pipelines using SageMaker Pipelines. The book also covers best practices for building scalable, secure, and production-ready GenAI systems. AWS AI hero Joshua Arvin Lat equips you with the skills and practical knowledge to handle a wide variety of ML engineering requirements, helping you design, operationalize, and secure generative AI systems and AI agents on AWS with confidence. *Email sign-up and proof of purchase required"

Preface

Free benefits with your book

Free Chapter

Chapter 1: A Gentle Introduction to Generative AI and AI Agents on AWS

Technical requirements

Generative AI for the modern machine learning engineer

Exploring foundation models in Amazon Bedrock

Setting up and configuring your SageMaker Studio environment

Configuring IAM permissions for your SageMaker Studio space

Introduction to AI agents with Amazon Bedrock and Strands Agents

Summary

Further reading

Chapter 2: Building AI Agents with SageMaker AI and Bedrock AgentCore

Technical requirements

Deploying a pretrained LLM with SageMaker AI

Building AI agents with Amazon SageMaker AI and Strands Agents

Building AI agents with Amazon Bedrock AgentCore

Deploying production-ready agents with Bedrock AgentCore Runtime

Setting up an Amazon Bedrock Knowledge Base

Building a RAG-powered AI agent with Strands Agents

Building a RAG-powered AI agent that interacts with a SageMaker AI inference endpoint

Summary

Further reading

Chapter 3: Machine Learning Engineering with Amazon SageMaker AI

Technical requirements

Setting up and preparing your JupyterLab notebook

Preparing a synthetic dataset for binary classification

Training an XGBoost binary classifier

Deploying an XGBoost model to a real-time inference endpoint

Setting up BERT fine-tuning with SageMaker JumpStart

Using a smaller dataset for fine-tuning

Running the BERT model fine-tuning job

Deploying the fine-tuned model to a real-time inference endpoint

Summary

Further reading

Chapter 4: Modernizing Analytics with a Managed Transactional Data Lake

Technical requirements

Preparing and processing the synthetic data

Creating an Amazon S3 table bucket

Launching an Amazon EMR cluster with Apache Iceberg installed

Performing Apache Iceberg queries on S3 tables with Apache Spark

Performing time travel queries on S3 tables

Summary

Further reading

Chapter 5: Practical Data Management on AWS

Technical requirements

Working with AWS Lake Formation permissions

Running SQL queries in Amazon Athena

Ingesting data into a SageMaker feature store

Adding searchable metadata to features

Retrieving data from the online and offline feature stores

Summary

Further reading

Chapter 6: Pragmatic Data Processing on AWS

Technical requirements

Getting started with SageMaker Processing jobs

Running your first SageMaker Processing job

Preparing the input data and processing script for the back translation job

Automating back translation workflows with SageMaker Processing jobs

Summary

Further reading

Chapter 7: SageMaker AI Model Training and Tuning Capabilities

Technical requirements

Setting up a serverless MLflow app

Fine-tuning an LLM on Amazon SageMaker AI

Deploying the Fine-Tuned Model

Performing Hyperparameter Tuning with Amazon SageMaker AI

Deploying the Best-Performing Model from Hyperparameter Tuning

Summary

Further reading

Chapter 8: SageMaker AI Model Deployment Options and Strategies

Technical requirements

Preparing your JupyterLab notebook for model deployment

Deploying your model to a real-time inference endpoint

Deploying your model to a serverless inference endpoint

Running batch inference with batch transform

Deploying your model to an asynchronous inference endpoint

Setting up a shadow test with a SageMaker inference endpoint

Using canary traffic shifting when performing Blue/Green deployments

Summary

Further reading

Chapter 9: Automating LLMOps Workflows with SageMaker Pipelines

Technical requirements

Setting up the project environment and dependencies

Building and running the Single-Step Fine-Tuning pipeline

Building and running the Single-Step evaluation pipeline

Configuring and running a Two-Step Fine-Tuning and evaluation pipeline

Preparing the Lambda functions for deployment of a model to an endpoint

Completing the LLMOps pipeline

Best practices and key considerations for building automated ML workflows

Summary

Further reading

Other Books You May Enjoy

Index

Machine Learning Engineering on AWS - Second Edition

By : Joshua Arvin Lat

Machine Learning Engineering on AWS

By: Joshua Arvin Lat

Overview of this book

8

SageMaker AI Model Deployment Options and Strategies

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access