-
Book Overview & Buying
-
Table Of Contents
Machine Learning Infrastructure and Best Practices for Software Engineers
By :
Machine Learning Infrastructure and Best Practices for Software Engineers
By:
Overview of this book
Although creating a machine learning pipeline or developing a working prototype of a software system from that pipeline is easy and straightforward nowadays, the journey toward a professional software system is still extensive. This book will help you get to grips with various best practices and recipes that will help software engineers transform prototype pipelines into complete software products.
The book begins by introducing the main concepts of professional software systems that leverage machine learning at their core. As you progress, you’ll explore the differences between traditional, non-ML software, and machine learning software. The initial best practices will guide you in determining the type of software you need for your product. Subsequently, you will delve into algorithms, covering their selection, development, and testing before exploring the intricacies of the infrastructure for machine learning systems by defining best practices for identifying the right data source and ensuring its quality.
Towards the end, you’ll address the most challenging aspect of large-scale machine learning systems – ethics. By exploring and defining best practices for assessing ethical risks and strategies for mitigation, you will conclude the book where it all began – large-scale machine learning software.
Table of Contents (24 chapters)
Preface
Part 1:Machine Learning Landscape in Software Engineering
Machine Learning Compared to Traditional Software
Elements of a Machine Learning System
Data in Software Systems – Text, Images, Code, and Their Annotations
Data Acquisition, Data Quality, and Noise
Quantifying and Improving Data Properties
Part 2: Data Acquisition and Management
Processing Data in Machine Learning Systems
Feature Engineering for Numerical and Image Data
Feature Engineering for Natural Language Data
Part 3: Design and Development of ML Systems
Types of Machine Learning Systems – Feature-Based and Raw Data-Based (Deep Learning)
Training and Evaluating Classical Machine Learning Systems and Neural Networks
Training and Evaluation of Advanced ML Algorithms – GPT and Autoencoders
Designing Machine Learning Pipelines (MLOps) and Their Testing
Designing and Implementing Large-Scale, Robust ML Software
Part 4: Ethical Aspects of Data Management and ML System Development
Ethics in Data Acquisition and Management
Ethics in Machine Learning Systems
Integrating ML Systems in Ecosystems
Summary and Where to Go Next
Index