Book Image

Machine Learning with Amazon SageMaker Cookbook

By : Joshua Arvin Lat
Book Image

Machine Learning with Amazon SageMaker Cookbook

By: Joshua Arvin Lat

Overview of this book

Amazon SageMaker is a fully managed machine learning (ML) service that helps data scientists and ML practitioners manage ML experiments. In this book, you'll use the different capabilities and features of Amazon SageMaker to solve relevant data science and ML problems. This step-by-step guide features 80 proven recipes designed to give you the hands-on machine learning experience needed to contribute to real-world experiments and projects. You'll cover the algorithms and techniques that are commonly used when training and deploying NLP, time series forecasting, and computer vision models to solve ML problems. You'll explore various solutions for working with deep learning libraries and frameworks such as TensorFlow, PyTorch, and Hugging Face Transformers in Amazon SageMaker. You'll also learn how to use SageMaker Clarify, SageMaker Model Monitor, SageMaker Debugger, and SageMaker Experiments to debug, manage, and monitor multiple ML experiments and deployments. Moreover, you'll have a better understanding of how SageMaker Feature Store, Autopilot, and Pipelines can meet the specific needs of data science teams. By the end of this book, you'll be able to combine the different solutions you've learned as building blocks to solve real-world ML problems.
Table of Contents (11 chapters)

Chapter 5: Effectively Managing Machine Learning Experiments

In the previous chapter, we worked on several recipes that focused on preparing and processing the data before passing it as input to the training jobs. In this chapter, we will focus on different solutions and capabilities to help us manage machine learning (ML) experiments in Amazon SageMaker.

Once we have performed a certain number of ML experiments, we will realize that not all experiments succeed, and it takes a bit of trial and error to build high-quality ML models. This is somewhat similar to software development where bugs in the code need to be detected as early as possible to prevent these bugs from accidentally getting deployed into a production environment. Debugging ML experiments is generally much harder compared to debugging issues in software code since we would need a specialized tool that inspects and monitors changes in the values of parameters, metrics, and other variables in the experiment and performs...