PostgreSQL 12 High Availability Cookbook - Third Edition

By : Shaun Thomas

PostgreSQL 12 High Availability Cookbook - Third Edition

By: Shaun Thomas

Overview of this book

Databases are nothing without the data they store. In the event of an outage or technical catastrophe, immediate recovery is essential. This updated edition ensures that you will learn the important concepts related to node architecture design, as well as techniques such as using repmgr for failover automation. From cluster layout and hardware selection to software stacks and horizontal scalability, this PostgreSQL cookbook will help you build a PostgreSQL cluster that will survive crashes, resist data corruption, and grow smoothly with customer demand. You’ll start by understanding how to plan a PostgreSQL database architecture that is resistant to outages and scalable, as it is the scaffolding on which everything rests. With the bedrock established, you'll cover the topics that PostgreSQL database administrators need to know to manage a highly available cluster. This includes configuration, troubleshooting, monitoring and alerting, backups through proxies, failover automation, and other considerations that are essential for a healthy PostgreSQL cluster. Later, you’ll learn to use multi-master replication to maximize server availability. Later chapters will guide you through managing major version upgrades without downtime. By the end of this book, you’ll have learned how to build an efficient and adaptive PostgreSQL 12 database cluster.

Preface

Who this book is for

What this book covers

To get the most out of this book

Sections

Get in touch

Architectural Considerations

Setting expectations with RPO

Defining timetables through RTO

Picking redundant copies

Selecting locations

Having enough backups

Considering quorum

Introducing indirection

Preventing split brain

Incorporating multi-master

Leveraging multi-master

Free Chapter

Hardware Planning

Planning for redundancy

Allocating enough memory

Exploring nimble networking

Managing motherboards

Minimizing Downtime

Determining acceptable losses

Configuration – getting it right the first time

Configuration – managing scary settings

Identifying important tables

Defusing cache poisoning

Terminating rogue connections

Reducing contention with concurrent indexes

Managing system migrations

Managing software upgrades

Mitigating the impact of hardware failure

Applying bonus kernel tweaks

Proxy and Pooling Resources

Exploring the magic of virtual IPs

Obtaining and installing HAProxy

Configuring HAProxy to load balance PostgreSQL

Determining connection costs and limits

Installing PgBouncer

Configuring PgBouncer safely

Connecting to PgBouncer

Listing PgBouncer server connections

Listing PgBouncer client connections

Evaluating PgBouncer pool health

Changing PgBouncer connections while online

Enhancing PgBouncer authentication

Troubleshooting

Performing triage

Installing common statistics packages

Evaluating the current disk performance with iostat

Tracking I/O-heavy processes with iotop

Viewing past performance with sar

Correlating performance with dstat

Interpreting /proc/meminfo

Examining /proc/net/bonding/bond0

Checking the pg_stat_activity view

Checking the pg_stat_statements view

Deciphering database locks

Debugging with strace

Logging checkpoints properly

Monitoring

Figuring out what to monitor

Installing and configuring Nagios

Configuring Nagios to monitor a database host

Enhancing Nagios with Check_MK

Getting to know check_postgres

Installing and configuring Telegraf

Adding a custom PostgreSQL monitor to Telegraf

Installing and configuring InfluxDB

Installing and configuring Grafana

Building a graph in Grafana

Customizing a Grafana graph

Using InfluxDB tags in Grafana

PostgreSQL Replication

Deciding what to copy

Securing the WAL stream

Setting up a hot standby

Upgrading to asynchronous replication

Bulletproofing with synchronous replication

Faking replication with pg_receivewal

Setting up Slony

Copying a few tables with Slony

Setting up Bucardo

Copying a few tables with Bucardo

Setting up pglogical

Copying a few tables with pglogical

Copying a few tables with native logical replication

Backup Management

Deciding when to use third-party tools

Installing and configuring Barman

Backing up a database with Barman

Restoring a database with Barman

Obtaining Barman diagnostics and information

Sending Barman backups to a remote location

Installing and configuring pgBackRest

Backing up a database with pgBackRest

Restoring a database with pgBackRest

Installing and configuring WAL-E

Managing WAL files with WAL-E

High Availability with repmgr

Preparing systems for repmgr

Installing and configuring repmgr

Cloning a database with repmgr

Incorporating a repmgr witness

Performing a managed failover

Customizing the failover process

Using an outage to test availability

Returning a node to the cluster

Integrating primary fencing

Performing online maintenance and upgrades

High Availability with Patroni

Understanding more about Patroni and its components

Preparing systems for the stack

Installing and configuring etcd

Installing and configuring Patroni

Installing and configuring HAProxy

Performing a managed switchover

Using an outage to test availability

Returning a node to the cluster

Adding additional nodes to the mix

Replacing etcd with ZooKeeper

Replacing etcd with Consul

Upgrading while staying online

Low-Level Server Mirroring

Understanding our chosen filesystem components

Preparing systems for volume mirroring

Getting started with the LVM

Adding block-level replication

Incorporating the second LVM layer

Verifying a DRBD filesystem

Correcting a DRBD split brain

Formatting an XFS filesystem

Tweaking XFS performance

Maintaining an XFS filesystem

Using LVM snapshots

Switching live stack systems

Detaching a problematic node

High Availability via Pacemaker

Before we begin...

Installing the components

Configuring Corosync

Preparing start up services

Starting with base options

Adding DRBD to cluster management

Adding LVM to cluster management

Adding XFS to cluster management

Adding PostgreSQL to cluster management

Adding a virtual IP to proxy the cluster

Adding an email alert

Grouping associated resources

Combining and ordering related actions

Performing a managed resource migration

Using an outage to test migration

High Availability with Multi-Master Replication

Overview of multi-master

Deciding whether multi-master is right for you

Obtaining and installing BDR

Starting with a single BDR node

Creating an additional BDR node

Testing DDL replication on each node

Using sequences safely

Configuring HAProxy for the multi-master approach

Combining PgBouncer with HAProxy

Performing a managed node switchover

Improving failover speed

Performing a major version upgrade online

Data Distribution

Identifying horizontal candidates

Setting up a foreign PostgreSQL server

Mapping a remote user

Creating a foreign table

Using a foreign table in a query

Optimizing foreign table access

Transforming foreign tables into local tables

Creating a scalable nextval replacement

Building a sharding API

Talking to the correct shard

Moving a shard to another server

Zero-downtime Upgrades

Preparing upgrade requirements

Remembering PgBouncer and pglogical

Creating a publication set

Handling sequences

Bootstrapping the target cluster

Starting the subscription

Monitoring progress

Switching targets

Cleaning everything up

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Defining timetables through RTO

Like RPO, RTO refers to a common business continuity term known as Recovery Time Objective. In practice, this is the amount of time an outage of the database layer may last. Often, it is incorporated into a Service Level Agreement (SLA) contract presented to clients or assumed as a metric within the application stack. Like RPO, this is a contractual-level element that can determine the number of required nodes at steadily increasing expense as the amount of tolerable downtime decreases.

In this recipe, we will examine the necessary steps to defining a realistic RTO, and what that could mean given known industry standards.

Getting ready

As with RPO, our goal in determining a functional RTO is to set expectations regarding inherent architecture limitations. The primary difference here is that RTO is more easily quantifiable. Fire up your favorite spreadsheet program, such as OpenOffice, Microsoft Excel, or Google Sheets; we'll be using it to keep track of how much time each layer of the application, including the database layer contributes to a potential outage scenario.

How to do it...

We simply need to produce a spreadsheet to track all of the elements of known RTO that depend on the database. We can do this with the following steps:

Locate an already-defined RTO SLA for each portion of the application dependent on PostgreSQL if possible.
If this does not exist, seek the input of major decision makers:

VP and C-level executives involved with technology
Product manager
Application designers and architects
Infrastructure team lead

Find an amount of time that will satisfy most or all of the above.
Create a new spreadsheet for RTO.
Create a heading row with the following columns:

Activity
Time (seconds)
Count
Total (seconds)

In the Total column, create the following formula:

=B2*C2

Create one row for each type of the following Activity categories:

Minor Upgrade
Major Upgrade
Reboot
Switchover
Failover
OS Upgrade
Etc.

Copy and paste the formula into the Total column for all the rows we created.

At the bottom of the Total column, after all relevant rows (row 21, for example), create the following formula:

=SUM(D2:D20)

Ensure that the end result looks something like the following screenshot:
Follow the rest of the advice in this chapter to find a suitable architecture.
Try to determine a rough cost for this and the closest alternative(s).
Present the design and cost estimates to decision makers.
Document this final RTO decision and architecture as reference material.

How it works...

In order to see where our PostgreSQL cluster fits company expectations, we need to know whether the company and each individual part of the existing application stack has an overall target RTO. If it doesn't, it's our job to approximate one. This means contacting any decision-makers, product owners, architects, and so on, to know what RTO target we're trying to attain and how other resources may contribute. These will act as a type of maximum value we can't exceed.

Keep in mind that RTO values tend to be amplified between layers. If our RTO is higher than some portion of the application stack, that will necessarily raise the RTO of that layer as well, which may increase the RTO of each subsequent layer. This is the exact scenario we're trying to avoid.

Once we have an RTO expectation, we need to examine how possible it is to fall under that target. The easiest way to accomplish this is to build a spreadsheet that essentially consists of a list of dependencies, maintenance tasks, or other occurrences related to PostgreSQL.

The rows we used for Activity are mainly suggestions, and producing an exhaustive list is generally dependent on the architecture to a certain extent. However, all software requires upgrades, machines need to be rebooted, switchover tests to prove high availability functionality may be required, past experience with the full application stack and hardware may imply two unexpected outages per year, and so on. Each of these will contribute to the cumulative RTO for PostgreSQL which we can use as a reference value.

The number we use for the Count column should be the number of times the Activity happens on a yearly basis. As an example, PostgreSQL has a quarterly release schedule for non-critical bug and security enhancements. If you want to follow along with these, it could make sense to set the Count column of Minor Upgrade to 4.

A number of architectural examples that we'll discuss later in this chapter will make it possible to set the Time column to 0 for some actions, or at least to a much lower value. We'll discuss these where relevant. This is also one of the reasons we'll need to execute this recipe multiple times when deciding on an appropriate architecture.

Once we have accounted for as many foreseeable Action components that may be necessary over the course of a year, we'll have a cumulative total that may represent the RTO that PostgreSQL can achieve for a given architecture. As a sanity check, we should compare that value to the lowest RTO for any parts of the application stack that depend on PostgreSQL. It's important we don't exceed this target.

Then, as with RPO, we need to present the possible RTO to decision-makers so that it can be integrated into the overall company RTO. To do that, we must continue with the rest of the chapter to find one or two architectures with either higher or lower expected RTO, estimate the cost of each, and work on a suitable compromise.

Deriving an appropriate RTO may require multiple iterations of this recipe, from estimation, architecture selection, presenting it to appropriate parties, and so on. This isn't a fast or simple process, and it pays to get it right early. We need to know how many PostgreSQL nodes to purchase, where each will reside, how we switch to alternatives, how much time each step may take, and so on.

There's more...

Besides what we discussed in the main recipe, there are other RTO concepts we would like to explore.

This may seem familiar

Believe it or not, it's very likely you've encountered this concept without even realizing it. Internet service providers or application hosts often advertise how many 9s of availability their platform can maintain. It's often presented as a chart like this:

Uptime (%)	Daily	Weekly	Monthly	Yearly
99	14m 24s	1h 40m 48s	7h 18m 18s	3d 15h 39m 30s
99.9	1m 26s	10m 5s	43m 50s	8h 45m 57s
99.99	8.6s	1m 1s	4m 23s	52m 36s
99.999	0.9s	6s	26.3s	5m 16s

As you can imagine, it's generally more desirable to stay toward the higher end of 9s to minimize downtime. On the other hand, this is highly restrictive, as Five 9s only allows just over five minutes of downtime over the course of an entire year. This doesn't leave much room for database maintenance tasks or unexpected outages at any other layer of the stack.

Node counts

Generally, the more nodes we have, the lower our RTO will be. It may make sense to start with an initial estimate spreadsheet, and then create another for each architecture or variant that seems applicable. This will make it easier to rank the monetary cost and associated RTO for each. This may influence the final decision, and hence make it easier to track what options we may have.

PostgreSQL 12 High Availability Cookbook - Third Edition

By : Shaun Thomas

PostgreSQL 12 High Availability Cookbook - Third Edition

By: Shaun Thomas

Overview of this book

Related Content you might be interested in

Current Title:

PostgreSQL 12 High Availability Cookbook - Third Edition

PostgreSQL 13 Cookbook

PostgreSQL 11 Administration Cookbook

PostgreSQL 16 Administration Cookbook