Book Image

Practical Machine Learning on Databricks

By : Debu Sinha
Book Image

Practical Machine Learning on Databricks

By: Debu Sinha

Overview of this book

Unleash the potential of databricks for end-to-end machine learning with this comprehensive guide, tailored for experienced data scientists and developers transitioning from DIY or other cloud platforms. Building on a strong foundation in Python, Practical Machine Learning on Databricks serves as your roadmap from development to production, covering all intermediary steps using the databricks platform. You’ll start with an overview of machine learning applications, databricks platform features, and MLflow. Next, you’ll dive into data preparation, model selection, and training essentials and discover the power of databricks feature store for precomputing feature tables. You’ll also learn to kickstart your projects using databricks AutoML and automate retraining and deployment through databricks workflows. By the end of this book, you’ll have mastered MLflow for experiment tracking, collaboration, and advanced use cases like model interpretability and governance. The book is enriched with hands-on example code at every step. While primarily focused on generally available features, the book equips you to easily adapt to future innovations in machine learning, databricks, and MLflow.
Table of Contents (16 chapters)
1
Part 1: Introduction
4
Part 2: ML Pipeline Components and Implementation
8
Part 3: ML Governance and Deployment

Part 1: Introduction

This part mainly focuses on data science use cases, the life cycle of and personas involved in a data science project (data engineers, analysts, and scientists), and the challenges of ML development in organizations.

This section has the following chapters: