Machine Learning Engineering with Python - Second Edition

By : Andrew P. McMahon

1.8 (4)

Buy this Book

Machine Learning Engineering with Python - Second Edition

1.8 (4)

By: Andrew P. McMahon

Buy this Book

Overview of this book

The Second Edition of Machine Learning Engineering with Python is the practical guide that MLOps and ML engineers need to build solutions to real-world problems. It will provide you with the skills you need to stay ahead in this rapidly evolving field. The book takes an examples-based approach to help you develop your skills and covers the technical concepts, implementation patterns, and development methodologies you need. You'll explore the key steps of the ML development lifecycle and create your own standardized "model factory" for training and retraining of models. You'll learn to employ concepts like CI/CD and how to detect different types of drift. Get hands-on with the latest in deployment architectures and discover methods for scaling up your solutions. This edition goes deeper in all aspects of ML engineering and MLOps, with emphasis on the latest open-source and cloud-based technologies. This includes a completely revamped approach to advanced pipelining and orchestration techniques. With a new chapter on deep learning, generative AI, and LLMOps, you will learn to use tools like LangChain, PyTorch, and Hugging Face to leverage LLMs for supercharged analysis. You will explore AI assistants like GitHub Copilot to become more productive, then dive deep into the engineering considerations of working with deep learning.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Introduction to ML Engineering

Technical requirements

Defining a taxonomy of data disciplines

Working as an effective team

ML engineering in the real world

What does an ML solution look like?

High-level ML system design

Summary

Free Chapter

The Machine Learning Development Process

Technical requirements

Setting up our tools

Concept to solution in four steps

Summary

From Model to Model Factory

Technical requirements

Defining the model factory

Learning about learning

Engineering features for machine learning

Designing your training system

Retraining required

Persisting your models

Building the model factory with pipelines

Summary

Packaging Up

Technical requirements

Writing good Python

Choosing a style

Packaging your code

Building your package

Testing, logging, securing, and error handling

Not reinventing the wheel

Summary

Deployment Patterns and Tools

Technical requirements

Architecting systems

Exploring some standard ML patterns

Containerizing

Hosting your own microservice on AWS

Building general pipelines with Airflow

Building advanced ML pipelines

Selecting your deployment strategy

Summary

Scaling Up

Technical requirements

Scaling with Spark

Spinning up serverless infrastructure

Containerizing at scale with Kubernetes

Scaling with Ray

Designing systems at scale

Summary

Deep Learning, Generative AI, and LLMOps

Going deep with deep learning

Living it large with LLMs

Building the future with LLMOps

Summary

Building an Example ML Microservice

Technical requirements

Understanding the forecasting problem

Designing our forecasting service

Selecting the tools

Training at scale

Serving the models with FastAPI

Containerizing and deploying to Kubernetes

Summary

Building an Extract, Transform, Machine Learning Use Case

Technical requirements

Understanding the batch processing problem

Designing an ETML solution

Selecting the tools

Executing the build

Summary

Other Books You May Enjoy

Index

Customer Reviews

1.8 (4)

5 star

4 star

25%

3 star

2 star

1 star

75%

Designing an ETML solution

The requirements clearly point us to a solution that takes in some data and augments it with ML inference, before outputting the data to a target location. Any design we come up with must encapsulate these steps. This is the description of any ETML solution, and this is one of the most used patterns in the ML world. In my opinion it will remain important for a long time to come as it is particularly suited to ML applications where:

Latency is not critical: If you can afford to run on a schedule and there are no high-throughput or low-latency response time requirements, then running as an ETML batch is perfectly acceptable.
You need to batch the data for algorithmic reasons: A great example of this is the clustering approach we will use here. There are ways to perform clustering in an online setting, where the model is continually updated as new data comes in, but some approaches are simpler if you have all the relevant data taken together...

Machine Learning Engineering with Python - Second Edition

By : Andrew P. McMahon

Machine Learning Engineering with Python - Second Edition

By: Andrew P. McMahon

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning Engineering with Python - Second Edition

Practical Machine Learning on Databricks

Machine Learning Engineering with MLflow.

Generative AI with LangChain

Designing an ETML solution