Book Image

Machine Learning with BigQuery ML

By : Alessandro Marrandino
Book Image

Machine Learning with BigQuery ML

By: Alessandro Marrandino

Overview of this book

BigQuery ML enables you to easily build machine learning (ML) models with SQL without much coding. This book will help you to accelerate the development and deployment of ML models with BigQuery ML. The book starts with a quick overview of Google Cloud and BigQuery architecture. You'll then learn how to configure a Google Cloud project, understand the architectural components and capabilities of BigQuery, and find out how to build ML models with BigQuery ML. The book teaches you how to use ML using SQL on BigQuery. You'll analyze the key phases of a ML model's lifecycle and get to grips with the SQL statements used to train, evaluate, test, and use a model. As you advance, you'll build a series of use cases by applying different ML techniques such as linear regression, binary and multiclass logistic regression, k-means, ARIMA time series, deep neural networks, and XGBoost using practical use cases. Moving on, you'll cover matrix factorization and deep neural networks using BigQuery ML's capabilities. Finally, you'll explore the integration of BigQuery ML with other Google Cloud Platform components such as AI Platform Notebooks and TensorFlow along with discovering best practices and tips and tricks for hyperparameter tuning and performance enhancement. By the end of this BigQuery book, you'll be able to build and evaluate your own ML models with BigQuery ML.
Table of Contents (20 chapters)
1
Section 1: Introduction and Environment Setup
5
Section 2: Deep Learning Networks
9
Section 3: Advanced Models with BigQuery ML
15
Section 4: Further Extending Your ML Capabilities with GCP

Configuring BigQuery Flex Slots

In this chapter, we'll understand how to configure BigQuery Flex Slots to train our ML model.

A BigQuery Slot is a unit of BigQuery analytics capacity that's used to execute SQL queries and train BigQuery ML models. One BigQuery slot represents the compute capacity of a Virtual Compute Processing Unit (VCPU).

Flex Slots allow us to buy BigQuery analytics capacity for short periods. They are usually used to quickly satisfy sudden demands for resources with a minimum duration of 60 seconds.

Enabling Flex Slots is mandatory to train a matrix factorization model; otherwise, BigQuery will return an error during the training stage.

Let's see how we can enable BigQuery Flex Slots if we're using an on-demand plan:

  1. If you haven't enabled BigQuery Reservations yet, we need to access Reservations from the BigQuery menu on the left:

    Figure 9.3 – Accessing Reservations from the BigQuery navigation menu

  2. Click...