Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying Machine Learning Engineering on AWS
  • Table Of Contents Toc
Machine Learning Engineering on AWS

Machine Learning Engineering on AWS - Second Edition

By : Joshua Arvin Lat
close
close
Machine Learning Engineering on AWS

Machine Learning Engineering on AWS

By: Joshua Arvin Lat

Overview of this book

Modern AI systems increasingly leverage large language models, retrieval-augmented generation, and AI agents to power generative AI applications in the cloud. As organizations operationalize these systems at scale, there is a growing need for engineers with strong machine learning engineering expertise. To stay ahead in this rapidly evolving field, you need a deep understanding of AI and ML concepts as well as, practical, hands-on experience with the platforms and tools used to build and operate production-grade AI systems. Machine Learning Engineering on AWS is a practical guide that shows you how to use AWS services such as Amazon Bedrock and Amazon SageMaker AI to fine-tune, evaluate, and deploy LLMs and generative AI systems. You'll learn how to develop RAG-powered systems, build and deploy AI agents using Bedrock AgentCore and Strands Agents, evaluate models using LLM-as-a-judge techniques, and automate LLMOps pipelines using SageMaker Pipelines. The book also covers best practices for building scalable, secure, and production-ready GenAI systems. AWS AI hero Joshua Arvin Lat equips you with the skills and practical knowledge to handle a wide variety of ML engineering requirements, helping you design, operationalize, and secure generative AI systems and AI agents on AWS with confidence. *Email sign-up and proof of purchase required"
Table of Contents (12 chapters)
close
close
10
Other Books You May Enjoy
11
Index

8

SageMaker AI Model Deployment Options and Strategies

Imagine you've spent days or weeks training an ML model that can solve a real problem. As you probably know already, if it just sits in your experimentation environment, nobody else can use it. To make it truly useful, you need to deploy your model so it can handle requests, process data, and deliver predictions in real time. In the past, you would have had to write custom code to serve your model and set up the entire serving infrastructure. You would also need to manage scaling, load balancing, and reliability manually, which could take days or even weeks. This made deploying models time-consuming and error-prone. SageMaker AI simplifies this process by managing the infrastructure and offering a range of deployment options, so you can focus on putting your model into action and realizing its value.

In this chapter, we'll explore various options and strategies for deploying models in SageMaker AI. You will work with an...

CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Machine Learning Engineering on AWS
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon