Book Image

Getting Started with Amazon SageMaker Studio

By : Michael Hsieh
Book Image

Getting Started with Amazon SageMaker Studio

By: Michael Hsieh

Overview of this book

Amazon SageMaker Studio is the first integrated development environment (IDE) for machine learning (ML) and is designed to integrate ML workflows: data preparation, feature engineering, statistical bias detection, automated machine learning (AutoML), training, hosting, ML explainability, monitoring, and MLOps in one environment. In this book, you'll start by exploring the features available in Amazon SageMaker Studio to analyze data, develop ML models, and productionize models to meet your goals. As you progress, you will learn how these features work together to address common challenges when building ML models in production. After that, you'll understand how to effectively scale and operationalize the ML life cycle using SageMaker Studio. By the end of this book, you'll have learned ML best practices regarding Amazon SageMaker Studio, as well as being able to improve productivity in the ML development life cycle and build and deploy models easily for your ML use cases.
Table of Contents (16 chapters)
1
Part 1 – Introduction to Machine Learning on Amazon SageMaker Studio
4
Part 2 – End-to-End Machine Learning Life Cycle with SageMaker Studio
11
Part 3 – The Production and Operation of Machine Learning with SageMaker Studio

Summary

In this chapter, we explored biases in ML and ML explainability with an adult income example. We learned that the data could contain unfair biases against a certain group or category in the dataset, which could translate into an ML model making unfair predictions. We worked through an adult income-level prediction example in SageMaker Studio to analyze and compute any bias prior to model training using SageMaker Clarify. Clarify produces metrics to quantify imbalance in the dataset that could potentially lead to unfair biases. We mitigated the imbalances using sampling and matching techniques and proceeded to train an ML model. We further analyzed the resulting ML model for potential bias in predictions using SageMaker Clarify. Finally, we reviewed how the ML model makes decisions using SageMaker Clarify and SHAP values.

In the next chapter, we will learn where to go after training an ML model in SageMaker. Hosting an ML model in the cloud is critical for most ML use cases...