Book Image

Google Cloud Platform for Architects

By : Vitthal Srinivasan, Loonycorn , Judy Raj

Book Image

Google Cloud Platform for Architects

By: Vitthal Srinivasan, Loonycorn , Judy Raj

Overview of this book

Using a public cloud platform was considered risky a decade ago, and unconventional even just a few years ago. Today, however, use of the public cloud is completely mainstream - the norm, rather than the exception. Several leading technology firms, including Google, have built sophisticated cloud platforms, and are locked in a fierce competition for market share. The main goal of this book is to enable you to get the best out of the GCP, and to use it with confidence and competence. You will learn why cloud architectures take the forms that they do, and this will help you become a skilled high-level cloud architect. You will also learn how individual cloud services are configured and used, so that you are never intimidated at having to build it yourself. You will also learn the right way and the right situation in which to use the important GCP services. By the end of this book, you will be able to make the most out of Google Cloud Platform design.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

The Case for Cloud Computing

The Case for Cloud Computing

Why Google Cloud Platform (GCP)?

Autoscaling and autohealing

Capital expenditure (CAPEX) versus operating expenses (OPEX)

Career implications

Introduction to Google Cloud Platform

Introduction to Google Cloud Platform

Global, regional, and zonal resources

Accessing the Google Cloud Platform

Projects and billing

Setting up a GCP account

Using the Cloud Shell

Compute Choices – VMs and the Google Compute Engine

Compute Choices – VMs and the Google Compute Engine

Google Compute Engine – GCE

Persistent disks and local SSDs – block storage for GCE

More on working with GCE VMs

Modifying GCE VMs

GKE, App Engine, and Cloud Functions

GKE, App Engine, and Cloud Functions

Creating a Kubernetes cluster and deploying a WordPress container

Using the features of GKE

Google App Engine – flexible

Google App Engine – standard

Google Cloud Storage – Fishing in a Bucket

Google Cloud Storage – Fishing in a Bucket

Knowing when (and when not) to use GCS

Serving Static Content with GCS Buckets

Storage classes–Regional, multi-regional, nearline, and coldline

Working with GCS buckets

Creating buckets

Transferring data in and out of buckets

Use case – Object Versioning

Use case – object life cycle policies

Use case – restricting access with both ACLs and IAM

Use case – signed and timed URLs

Use case – reacting to object changes

Use case – using customer supplied encryption keys

Use case – auto-syncing folders

Use case – mounting GCS using gcsfuse

Use case – offline ingestion options

Relational Databases

Relational Databases

Relational databases, SQL, and schemas

Use case – managing replicas

Use case – managing certificates

Use case – operating Cloud SQL through VM instances

Automatic backup and restore

NoSQL Databases

NoSQL Databases

NoSQL databases

Creating and operating an HBase table using Cloud Bigtable

Scaling GCP Cloud BigTable

The Google Cloud Datastore

Comparison with traditional databases

Working with Datastore

Full indexing and perfect index

BigQuery

Underlying data representation of BigQuery

BigQuery public datasets

Legacy versus standard SQL

Working with the BigQuery console

Loading data into a table using BigQuery

Deleting datasets

Working with BigQuery using CLI

BigQuery pricing

Analyzing financial time series with BigQuery

Identity and Access Management

Identity and Access Management

Resource hierarchy of GCP

Permissions and roles

Managing Hadoop with Dataproc

Managing Hadoop with Dataproc

Hadoop and Spark

Hadoop on the cloud

Google Cloud Dataproc

Compute options for Dataproc

Working with Dataproc

Load Balancing

Why load balancers matter now

Taxonomy of GCP load balancers

HTTP(S) load balancing

Configuring HTTP(S) load balancing

Configuring Internal Load Balancing

Other load balancing

Networking in GCP

Networking in GCP

Why GCP's networking model is unique

VPC networks and subnets

The default VPC

Internal and external IP addresses

VPN and cloud router

Working with VPCs

Working with custom subnets

Working with firewall rules

Logging and Monitoring

Logging and Monitoring

Infrastructure Automation

Infrastructure Automation

Managed Instance Groups

Cloud deployment manager

Security on the GCP

Security on the GCP

Security features at Google and on the GCP

Google-provided tools and options for security

Some security best practices

BeyondCorp – Identity-Aware Proxy

Pricing Considerations

Pricing Considerations

Google Kubernetes Engine

Cloud ML Engine

Video Intelligence API

Key Management Service – KMS

Effective Use of the GCP

Effective Use of the GCP

Eat the Kubernetes frog

Careful that you don't get nickel-and-dimed

Pay for what you allocate not what you use

Make friends with the gsuite admins

Try to find reasons to use network peering

Understand how sustained use discounts work

Read the fine print on GCS pricing

Use BigQuery unless you have a specific reason not to

Use pre-emptible instances in your Dataproc clusters

Keep your Dataproc clusters stateless

Understand the unified architecture for batch and stream

Understand the main choices for ML applications

Understand the differences between snapshots and images

Don't be Milton!

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Autoscaling and autohealing

The technical rationale for moving to the cloud can often be summed up in two words—autoscaling and autohealing.

Autoscaling: The idea of autoscaling is simple enough although the implementations can get quite involved—apps are deployed on compute, the amount of compute capacity increases or decreases depending on the level of incoming client requests. In a nutshell, all the public cloud providers have services that make autoscaling and autohealing easily available. Autoscaling, in particular, is a huge deal. Imagine a large Hadoop cluster, with say 1,000 nodes. Try scaling that; it probably is a matter of weeks or even months. You'd need to get and configure the machines, reshard the data and jump through a trillion hoops. With a cloud provider, you'd simply use an elastic version of Hadoop such as Dataproc on the GCP or Elastic MapReduce (EMR) on AWS and you'd be in business in minutes. This is not some marketing or sales spiel; the speed of scaling up and down on the cloud is just insane.

Here’s a little rhyme to help you remember the main point of our conversation here—we’ll keep using them throughout the remainder of the book just to mix things up a bit. Oh, and they might sometimes introduce a few new terms or ideas that will be covered at length in the following sections, so don’t let any forward references bother you just yet!

Autohealing: The idea of autohealing is just as important as that of autoscaling, but it is less explicitly understood. Let's say that we deploy an app that could be a Java JAR, Python package, or Docker container to a set of compute resources, which again could be cloud VMs, App Engine backends, or pods in a Kubernetes cluster. Those compute resources will have problems from time to time; they will crash, hang, run out of memory, throw exceptions, and misbehave in all kinds of unpredictable ways. If we did nothing about these problems, those compute resources would effectively be out of action, and our total compute capacity would fall and, sooner or later, become insufficient to meet client requests. So, clearly, we need to somehow detect whether our compute resources got sick, and then heal them. In the pre-cloud days, this would have been pretty manual, some poor sap of an engineer would have to nurse a bare metal or VM back to health. Now, with cloud-based abstractions, individual compute units are much more expendable. We can just take them down and replace them with new ones. Because these units of compute capacity are interchangeable (or fungible—a fancier word that means the same thing), autohealing is now possible: